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We have isolated and characterized a novel protein of pathogenic forms of Neisseria. We have also isolated and character- 
ized genes which encode PilC, i.e., the pilC loci. Portions of the DNA sequences of the pilC genes are useful as probes to diag- 
nose the presence of the relevant microorganisms containing type 4 pilin, for example. Neisseria in samples. These DNAs also 
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passive immunization. In addition, by utilizing probes derived from the DNA sequences, it is possible to isolate and sequence 
other portions of the pilC loci from species and strains of interest. 
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POLYPEPTIDES AND POLYNUCLEOTIDES USEFUL FOR THE 
DIAGNOSIS AND TREATT^NT OF PATHOGENIC NEISSERIA 



Technical Field 

The invention relates to materials and 
methodologies for managing the spread of infections 
caused by microorganisms having type 4 pilin, for 
example, Neisseria.. More specifically, it relates to 
polypeptides and antibodies useful in vaccines for the 
treatment of pathologic infections caused by these 
microorganisms* It also relates to polynucleotides 
useful for the recombinant production of these 
polypeptides. In addition, it- relates to polypeptides, 
antibodies, and polynucleotides used for the detection of 
these strains. 

Background Art 

Type 4 pilins are expressed by several 
bacterial genuses, including Neisseria, Moraxella, 
BacteroideSr and Pseudomonas . Species within these 
genuses which have pathogenic members that express type 4 
pilins are, for example, N. gonorrhoeae, N. meningitidis, 
M. bovis, B. nodosus, and P. aeruginosa. In addition, 
the Tcp pilin of V. cholerae is highly homologous to the 
type 4 pilins of other genuses. 
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The only known reservoir of the neisseriae is 
man. The genus includes two gram- negative species of 
pyogenic cocci that are pathogenic for man: the 
meningococcus {Neisseria, menlng-ltldls) and the gonococcus 
5 (Neisseria gonorrheae) . 

N. Menlng-ltldls causes a variety of infections, 
most notably, meningitis and bacteremia. Meningococci 
can be divided into serologic groups on the basis of 
agglutination reactions with immune serum. The present 

10 classification includes groups A through Z. Clinically 
significant new groups encon^ass Y and W 135. The major 
groups are remarkably heterogeneous, but subclassi- 
f ication with additional serologic markers has been 
possible. Noncapsular antigens have provided the basis 

15 for dividing strains of groups into distinct types. 

Meningococci cause either epidemic or sporadic 
disease, and historically, there has been a cyclic 
variation in the prevalence of meningococcal infection 
with peaks of increased frequency occurring every 8 to 12 

20 years and lasting 4 to 6 years. The attack rate of 

meningococcal disease is highest for children between 6 
months and 1 year. In the first half of this century, 
most epidemics of meningococcal disease in the United 
States were caused by group A organisms. In the past two 

25 decades, first group B then group C meningococci were 
responsible for outbreaks in both the military and 
civilian populations. Currently, group B is responsible 
for 50 to 55 percent of reported cases. 

Gonorrhea, which is caused by N. gonorrhea, is 

30 an infection of columnar and transitional epithelium. 

This disease is the most common reportable communicable 
disease in the United States, and also has world-wide 
prevalence - 
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Although treatment of disease caused by 
gonococci and meningococci are often treated with 
antibiotics, these microorganisms often develop 
antibiotic resistance. Thus, prevention with vaccines is 
5 a preferable mode to contain the spread of infection. 
However, for a variety of reasons, including antigenic 
variation, the development of vaccines has been greatly 
hampered. For example, a vaccine which prevents 
gonorrhea is still lacking. In addition, although 56% of 

10 the causes of meningococcal disease are caused by 

serogroup B, an effective vaccine against this serogroup 
is also lacking. 

N. gonorrheae and N. meningitidis are organisms 
completely adapted to the human host, having no other 

15 ecological niche. They have acquired a large arsenal of 
strategies to overcome the human host defense system. 

The first step in infection with pathological 
forms of these Neisseria is adherence to target cells. 
It is thought that the pili of these microorganisms are a 

20 major virulence factor. For example, it is known that in 
the case of N. gonorrheae, piliated (P"*") variants attach 
much better to susceptible cells than non-piliated (P-) 
variants (Swanson, 1973; Pearce and Buchanan, 1978) . 
Moreover, P+ variants, unlike P- variants, are able to 

25 establish an infection in human volunteers (Kellog et al, 
1968) . 

Although the pilus protein elicits an immune 
response, so many antigenic variants exist and continue 
to develop that vaccines against the pilus protein are 
30 not highly effective. 

Pilin is the major subunit of the pilus. 
Expression of pilin is controlled at the pilE locus. 
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Brief Description of the Invention 

We have isolated and characterized a novel 
protein of pathogenic forms of Neisseria, PilC, that may 
be is associated with the pili of gonococci and 
5 meningococci. We have also isolated and characterized 
genes which encode PilC, i.e., the pllC loci. 

Portions of the DNA sequences of the pllC genes 
are useful as probes to diagnose the presence of the 
relevant Neisseria in samples. These DNAs also make 
10 available polypeptide sequences of immunoreactive 

epitopes encoded within the loci, thus permitting the 
production of polypeptides which are useful as standards 
or reagents in diagnostic tests and/or as components of 
vaccines for microorganisms with type 4 pilin and 
15 containing one or more epitopes that are immunologically 
identifiable with an epitope encoded in pllC of 
Neisseria,. Antibodies, both monoclonal and purified 
polyclonal, directed against PilC epitopes are also 
useful for diagnostic tests and as therapeutic agents for 
20 passive immunization. In addition, by utilizing probes 

derived from the DNA secjuences, it is possible to isolate 
and sequence portions of the pllC loci from species and 
strains of interest. 

Accordingly, one embodiment of the invention is 
25 a recombinant polynucleotide encoding a polypeptide 
con^rised of an immunoreactive epitope of a protein 
encoded in pllC of Neisseria. 

Another embodiment of the invention is a 
recombinant expression system comprising a polynucleotide 
30 encoding a polypeptide con^rised of an immunoreactive 
epitope of a protein encoded in pllC of Neisseria, 
wherein the polynucleotide is operably linked to a 
control sequence compatible with a desired host. 



BNSOOCID: <WO 9213871A1J_> 



wo 92/13871 PCT/US92/00863 

-5- 

Yet another embodiment of the invention is 
purified polypeptide comprised of an immunoreactive 
epitope of a protein encoded in pilC of Neisseria,. 

Another embodiment of the invention is a 
5 recombinant polypeptide comprised of an immunoreactive 
epitope of a protein encoded in pilC of Neisseria. 

Still another embodiment of the invention is a 
vaccine composition for the treatment of Neisseria 
infection, comprised of a pharmaceutically acceptable 
10 excipient and of an effective amount of a recombinant 

polypeptide, wherein the polypeptide is comprised of an 
immunoreactive epitope of a protein encoded in pilC of 
Neisseria. 

Yet another embodiment of the invention is a 
15 composition comprised of purified polyclonal anti-PilC 
antibodies, wherein the PilC is of Neisseria. 

An additional embodiment of the invention is a 
composition comprised of a monoclonal antibody directed 
against an immunoreactive epitope encoded in pilC of 
20 Neisseria. 

Another embodiment of the invention is a method 
for producing antibodies to PilC of Neisseria comprising 
administering to an individual a composition comprised of 
an isolated immunogenic polypeptide containing a PilC 
25 epitope in an amount sufficient to produce an immune 
response. 

Yet another embodiment of the invention is an 
oligomer capable of hybridizing to a sequence in pilC of 
Neisseria, wherein the oligomer is comprised of a pilC 
30 sequence complementary to at least about 6 contiguous 
nucleotides of pilC. 

Still another embodiment of the invention is a 
process for detecting a pile sequence in an analyte 
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strand, wherein the pllC sequence comprises a selected 
target region, the process comprising: 

(a) providing a sample comprised of an analyte 
strand suspected of containing a selected target pllC 

5 sequence; 

(b) providing an oligomer capable of 
hybridizing to the target pllC sequence, wherein the 
oligomer is comprised of a pllC targeting sequence 
complementary to at least about 6 contiguous nucleotides 

10 of pllC; 

(c) incubating the sample of (a) with the 
oligomer of (b) under conditions which allow specific 
hybrid duplexes to form between the targeting sequence 
and the target sequence; and 

15 (d) detecting hybrids formed between the 

target sequence, if any, and the oligomer. 

Yet another embodiment of the invention is a 
recombinant polynucleotide comprising a DNA sequence of 
at least 8 contiguous nucleotides from pllC, wherein the 

20 pllC sequence is selected from the group of sequences 
shown in Figure 3, Figure 6, and Figure 7. 

Another embodiment of the invention is a method 
of treating an individual for a Neisseria, infection 
comprising administering to the individual antibodies 

25 produced according to claim 31, wherein the antibodies 
are administered in an amount effective to prevent the 
pathology of the infection. 

Brief Description of the Drawincrs 
30 Figure 1 is a genetic and physical map of pilC 

locus 1, showing the restriction enzyme sites. 

Figure 2 shows the nucleotide sequence and the 
deduced amino acid sequence of the 5' -end of pilCl. 

35 
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Figure 3 shows the nucleotide sequence of the 
sense strand of the pllCl gene. 

Figure 4 shows the shows the nucleotide 
sequence of the sense strand of the pxlCl gene and the 
5 amino acids encoded therein. 

Figure 5 shows the nucleotide sequence of the 
sense strand of the pilCl gene, and the effect of frame 
shift on the putative gene products encoded therein. 

Figure 6 shows the nucleotide sequences of PGR 
10 amplified fragments demonstrating a variation in the 

length of the G tract and sequence differences in the 5' 
region of the pilC genes. 

Figure 7 shows the DNA sequence of the 3' -end 
of the pilC2 fragment. 
15 Figure 8 shows the pilC2 fragment sequence, and 

the putative amino acids encoded therein. 

Figure 9 shows a comparison of the analogous 
portions of pilC2 (top) and pilCl (bottom) DNA sequences, 
and the putative amino acids encoded therein. 

20 

Modes for Carrying Out the Invention 

The present invention provides polypeptides, 
antibodies, and polynucleotides which are useful for the 
detection and treatment of pathogenic microorganisms 

25 having type 4 pilin, for example. Neisseria, Moraxella, 
BSLCteroides , and Pseudomonas . 

We have discovered a polypeptide, PilC, which 
is present in N. gonorrheae. This polypeptide is a 110 
kd protein that is closely associated with the pili of 

30 the microorganism. Most strains of gonorrheae carry 
two copies of the corresponding genes which encode the 
polypeptide (s) ; these genes have been denoted pilC. 
Expression from the pile loci is regulated by f recpaent 
frameshift mutations within a run of G residues in the 

35 
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region encoding the signal peptide. The two pilC genes 
of N, gonoxrrtiGcLB are not identical- Hence, alternate 
expression from either the pllCl or the pllC2 loci gives 
rise to two different forms of PilC- Among nonpiliated 
5 (P~) descendants from P"^ clones, clones were found that 
expressed pilin but not PilC. All P**" revertants from 
such Pile' non-piliated clones have regained expression 
of PilC- Hence, phase variation of gonococcal pili can 
be caused by f rameshif t mutations in pilC. Transposon 
10 inactivation of the expressed p±lC2 copy resulted in a 
nonpiliated, pilin producing revertible phenotype- It 
appears, therefore, that PilC is required for assembly of 
pilin subunits into a polymerized pilus fiber in 
gonorriieaB - 

15 We have cloned and isolated gene, pllCl, from 

gonorrhea.^. In addition, by coir5)arison of this gene 
sequence with a related sequence, we have cloned a 
fragment of the pil2 gene. Moreover, using 
polynucleotide probes derived from isolated pilCl and PCR 

20 aitplif ication, we have detected two possible variants of 
a pile gene in N. meningitidis. The sequences of pilC 
reported herein appear to be novel, in that there are no 
reported counterparts in Genbank, and no significant 
homologies were found with any available sequences in 

25 that data base. 

The useful materials and processes of the 
present invention are made possible by the provision of 
the sequences of the pilC genes from N, gonorrheas and 
from N. meningitidis . Information present in the 

30 sequences of the pilC genes allows for the design of 
polypeptides which may be useful as vaccines for 
treatment of pathogenic Neisseria, as diagnostic tools 
for the detection of these microorganisms, and as agents 
for the preparation of antibodies to PilC. In addition, 

35 
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this information allows for the design of polynucleotides 
for the recombinant production of the polypeptides 
derived from PilC, and for the design of oligomers which 
are useful as probes and primers for the detection and 
5 amplification of target regions of pilC. 

The practice of the present invention will 
employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology/ 
recombinant DNA, and immunology, which are within the 

10 skill of the art. Such technicjues are explained fully in 
the literature. See e.g., Maniatis, Fitsch & Sambrook, 
MOLECULAR CLONING; A LABORATORY MANUAL, Second Edition 
(1989) ; DNA CLONING, VOLUMES I AND II (D.N Glover ed. 
1985); OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984); 

15 NUCLEIC ACID HYBRIDIZATION (B,D. Hames & S.J. Higgins 

eds. 1984); TRANSCRIPTION AND TRANSLATION (B.D. Hames & 
S.J. Higgins eds. 1984); ANIMAL CELL CULTURE (R.I. 
Freshney ed. 1986) ; IMMOBILIZED CELLS AND ENZYMES (IRL 
Press, 1986); B. Perbal, A PRACTICAL GUIDE TO MOLECULAR 

20 CLONING (1984); the series, METHODS IN ENZYMOLOGY 
(Academic Press, Inc.); GENE TRANSFER VECTORS FOR 
MAMMALIAN CELLS (J.H. Miller and M.P. Calos eds. 1987, 
Cold Spring Harbor Laboratory) , Methods in Enzymology 
Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, eds., 

25 respectively), Mayer and Walker, eds. (1987), 

IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY 
(Academic Press, London), Scopes, (1987), PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 
(Springer-Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL 

30 IMMUNOLOGY, VOLUMES 1-IV (D.M. Weir and C. C. Blackwell 
eds 1986) . All patents, patent applications, and 
publications mentioned herein, both supra and infra, are 
hereby incorporated herein by reference. 

35 
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As used herein, a polynucleotide "derived from" 
a designated sequence refers to a polynucleotide sequence 
which is comprised of a sequence of approximately at 
least about 6 nucleotides, preferably at least about 8 
5 nucleotides, more preferably at least about 10-12 

nucleotides, and even more preferably at least about 
15-20 nucleotides corresponding to a region of the 
designated nucleotide sequence. "Corresponding" means 
homologous to or complementary to the designated 
10 sequence. Regions from which typical polynucleotide 

sequences may be "derived" include but are not limited 
to, for example, regions encoding specific epitopes, as 
well as non- transcribed and/or non- translated regions. 

The derived polynucleotide is not necessarily 
15 physically derived from the nucleotide sequence shown, 
but may be generated in any manner, including for 
example, chemical synthesis or DNA replication or reverse 
transcription or transcription. In addition, 
combinations of regions corresponding to that of the 
20 designated secjuence may be modified in ways known in the 
art to be consistent with an intended use. 

Similarly, a polypeptide or amino acid sequence 
"derived from" a designated nucleic acid sequence refers 
to a polypeptide having an amino acid sequence identical 
25 to that of a polypeptide encoded in the sequence, or a 

portion thereof wherein the portion consists of at least 
3-5 amino acids, and more preferably at least 8-10 amino 
acids, and even more preferably at least 11-15 amino 
acids, or which is immunologically identifiable with a 
30 polypeptide encoded in the sequence.. This terminology 
also includes a polypeptide expressed from a designated 
nucleic acid sequence. 

A recombinant or derived polypeptide is not 
necessarily translated from a designated nucleic acid 

35 
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sequence. It may be generated in any manner, including 
for example, chemical synthesis, or expression of a 
recombinant expression system, or isolation from a 
microorganism* A recombinant or derived polypeptide may 
5 include one or more analogs of amino acids or unnatural 
amino acids in its sequence. Methods of inserting 
analogs of amino acids into a sequence are known in the 
art. It also may include one or more labels, which are 
known to those of skill in the art . 

10 The term "recombinant polynucleotide" as used 

herein intends a polynucleotide of genomic, cDNA, 
semisynthetic, or synthetic origin which, by virtue of 
its origin or manipulation: (1) is not associated with 
all or a portion of a polynucleotide with which it is 

15 associated in nature, (2) is linked to a polynucleotide 

other than that to which it is linked in nature, (3) does 
not occur in nature, or (4) is not in the form of a 
library. 

The term "polynucleotide" as used herein refers 
20 to a polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides . This term 
refers only to the primary structure of the molecule. 
Thus, this term includes double- and single -stranded DNA 
and RNA. It also includes known types of modifications, 
25 for example, labels which are known in the art, 

methylation, "caps", substitution of one or more of the 
naturally occurring nucleotides with an analog, 
internucleotide modifications such as, for example, those 
with uncharged linkages (e.g., methyl phosphonates , 
3 0 phosphotriesters, phosphoamidates , carbamates, etc.) and 
with charged linkages (e.g., phosphorothioates , 
phosphorodithioates, etc.), those containing pendant 
moieties, such as, for example proteins (including for 
e.g., nucleases, toxins, antibodies, signal peptides. 
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poly-L- lysine, etc.), those with intercalators (e.g., 
acridine, psoralen, etc.), those containing chelators 
(e.g., metals, radioactive metals, boron, oxidative 
metals, etc.), those containing alkylators, those with 
modified linkages (e.g., alpha anomeric nucleic acids, 
etc.), as well as unmodified forms of the polynucleotide. 

The term "purified polynucleotide" refers to a 
polynucleotide which is essentially free, i.e., contains 
less than about 50%, preferably less than about 70%, and 
even more preferably less than about 90% of polypeptides 
with which the polynucleotide is naturally associated. 
Techniques for purifying polynucleotides from bacteria 
are known in the art, and include for example, disruption 
of the bacteria with a chaotropic agent, differential 
extraction and separation of the polynucleotide (s) and 
polypeptides by ion- exchange chromatography, affinity 
chromatography, and sedimentation according to density. 

The term "purified polypeptide" refers to a 
polypeptide or fragment thereof which is essentially 
free, i.e., contains less than about 50%, preferably less 
than about 70%, and even more preferably less than about 
90%, of cellular components with which the polypeptide is 
naturally associated- Techniques for purifying 
polypeptides are known in the art, and examples of these 
techniques are discussed infra. 

"Recombinant host cells", "host cells", 
"cells", "cell lines", "cell cultures", and other such 
terms denoting microorganisms or higher eukaryotic cell 
lines cultured as unicellular entities refer to cells 
which can be, or have been, used as recipients for 
recombinant vector or other transfer DNA, and include the 
progeny of the original cell which has been transfected. 
It is understood that the progeny of a single parental 
cell may not necessarily be completely identical in 
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morphology or in genomic or total DNA complement as the 
original parent, due to natural, accidental, or 
deliberate mutation. 

A "replicon" is any genetic element, e.g., a 
5 plasmid, a chromosome, a virus, a cosmid, etc. that 

behaves as an autonomous unit of polynucleotide replica- 
tion within a cell; i.e., capable of replication under 
its own control . 

A "vector" is a replicon in which another 
10 polynucleotide segment is attached, so as to bring about 
the replication and/or expression of the attached 
segment . 

"Control sequence" refers to polynucleotide 
sequences which are necessary to effect the expression of 

15 coding sequences to which they are ligated. The nature 

of such control sequences differs depending upon the host 
organism; in prokaryotes, such control sequences 
generally include promoter, ribosomal binding site, and 
terminators; in eukaryotes, generally, such control 

20 sequences include promoters, terminators and, in some 
instances, enhancers. The term "control sequences" is 
intended to include, at a minimum, all components whose 
presence is necessary for expression, and may also 
include additional components whose presence is 

25 advantageous, for example, leader sequences. 

"Operably linked" refers to a juxtaposition 
wherein the components so described are in a relationship 
permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence 

30 is ligated in such a way that expression of the coding 

sequence is achieved under conditions compatible with the 
control sequences. 

An "open reading frame" (ORF) is a region of a 
polynucleotide sequence which encodes a polypeptide; this 
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region may represent a portion of a coding sequence or a 
total coding sequence. 

A "coding sequence" is a polynucleotide 
sequence which is transcribed into mRNA and/or translated 
5 into a polypeptide when placed under the control of 

appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a translation start 
codon at the 5 '-terminus and a translation stop codon at 
the 3' -terminus- A coding sequence can include, but is 
10 not limited to mRNA, cDNA, and recombinant polynucleotide 
sequences . 

"Immunologically identifiable with/as" refers 
to the presence of epitope (s) and polypeptides (s) which 
are also present in the designated polypeptide ( s ) . 
15 Immunological identity may be determined by antibody 

binding and/or competition in binding; these technicgpies 
are known to those of average skill in the art, and are 
also illustrated infra. 

As used herein, "epitope" refers to an 

2 0 antigenic determinant of a polypeptide. An epitope could 

comprise 3 amino acids in a spatial conformation which is 
unique to the epitope. Generally an epitope consists of 
at least 5 such amino acids, and more usually, consists 
of at least 8-10 such amino acids. Methods of 
25 determining the spatial conformation of amino acids are 
known in the art, and include, for exaxtiple, x-ray 
crystallography and 2 -dimensional nuclear magnetic 
resonance . 

A polypeptide is "immunoreactive" when it is 

3 0 "immunologically reactive" with an antibody, i.e., when 

it binds to an antibody due to antibody recognition of a 
specific epitope contained within the polypeptide. 
Immunological reactivity may be determined by antibody 
binding, more particularly by the kinetics of antibody 

35 
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binding, and/or by competition in binding using as 
competitor (s) a known polypeptide (s) containing an 
epitope against which the antibody is directed- The 
techniques for determining whether a polypeptide is 
5 immunologically reactive with an antibody are known in 
the art. An "iramunoreactive" polypeptide may also be 
"immunogenic'*. As used herein, the term "immunogenic 
polypeptide" is a polypeptide that elicits a cellular 
and/or humoral immune response, whether alone or linked 

10 to a carrier in the presence or absence of an adjuvant. 

As used herein, the term "antibody" refers to a 
polypeptide or group of polypeptides which are comprised 
of at least one antibody combining site. An "antibody 
combining site" or "binding domain" is formed from the 

15 folding of variable domains of an antibody molecule (s) to 
form three-dimensional binding spaces with an internal 
surface shape and charge distribution complementary to 
the features of an epitope of an antigen, which allows an 
immunological reaction with the antigen. An antibody 

20 combining site may be formed from a heavy and/or a light 
chain domain (VH and VL, respectively) , which form 
hypervariable loops which contribute to antigen binding. 
The term "antibody" includes, for example, vertebrate 
antibodies, hybrid antibodies, chimeric antibodies, 

25 altered antibodies, univalent antibodies, the Fab 
proteins, and single domain antibodies. 

As used herein, a "single domain antibody" 
(dAb) is an antibody which is comprised of an VH domain, 
which reacts immunologically with a designated antigen. 

30 A dAB does not contain a VL domain, but may contain other 
antigen binding domains known to exist in antibodies, for 
example, the kappa and lambda domains. Methods for 
preparing dABs are known in the art. See, for example. 
Ward et al. (1989) . 
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Antibodies may also be comprised of VH and VL 
domains, as well as other known antigen binding domains. 
Examples of these types of antibodies and methods for 
their preparation are known in the art (see, e,g., U.S* 
5 Patent No. 4,816,467, which is incorporated herein by 
reference) , and include the following. For example, 
"vertebrate antibodies" refers to antibodies which are 
tetramers or aggregates thereof, comprising light and 
heavy chains which are usually aggregated in a "Y" 
10 configuration and which may or may not have covalent 

linkages between the chains. In vertebrate antibodies, 
the amino acid sequences of all the chains of a 
particular antibody are homologous with the chains found 
in one antibody produced by the lymphocyte which produces 
15 that antibody in situ, or in vitro (for example, in 

hybridomas) . Vertebrate antibodies typically include 
native antibodies, for example, purified polyclonal 
antibodies and monoclonal antibodies. Examples of the 
methods for the preparation of these antibodies are 
2 0 described infra . 

"Hybrid antibodies" are antibodies wherein one 
pair of heavy and light chains is homologous to those in 
a first antibody, while the other pair of heavy and light 
chains is homologous to those in a different second anti- 
25 body. Typically, each of these two pairs will bind 

different epitopes, particularly on different antigens. 
This results in the property of "divalence", i.e., the 
ability to bind two antigens simultaneously. Such 
hybrids may also be formed using chimeric chains, as set 
30 forth below • 

"Chimeric antibodies", are antibodies in which 
the heavy and/or light chains are fusion proteins. 
Typically the constant domain of the chains is from one 
particular species and/or class, and the variable domains 
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are from a different species and/or class. Also included 
is any antibody in which either or both of the heavy or 
light chains are composed of combinations of sequences 
mimicking the sequences in antibodies of different 
5 sources, whether these sources be differing classes, or 
different species of origin, and whether or not the 
fusion point is at the variable/ constant boundary. Thus, 
it is possible to produce antibodies in which neither the 
constant nor the variable region mimic known antibody 

10 sequences. It then becomes possible, for example, to 

construct antibodies whose variable region has a higher 
specific affinity for a particular antigen, or whose 
constant region can elicit enhanced complement fixation, 
or to make other improvements in properties possessed by 

15 a particular constant region. 

Another exait^le is "altered antibodies", which 
refers to antibodies in which the naturally occurring 
amino acid sequence in a vertebrate antibody has been 
varied. Utilizing recombinant DNA techniques, antibodies 

20 can be redesigned to obtain desired characteristics. The 
possible variations are many, and range from the changing 
of one or more amino acids to the complete redesign of a 
region, for example, the constant region. Changes in the 
constant region, in general, to attain desired cellular 

25 process characteristics, e.g., changes in complement 

fixation, interaction with membranes, and other effector 
functions. Changes in the variable region may be made to 
alter antigen binding characteristics. The antibody may 
also be engineered to aid the specific delivery of a 

3 0 molecule or substance to a specific cell or tissue site. 
The desired alterations may be made by known techniques 
in molecular biology, e.g., recombinant techniques, site 
directed mutagenesis, etc. 
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Yet another example are "univalent antibodies", 
which are aggregates comprised of a heavy chain/light 
chain dimer bound to the Fc (i.e., constant) region of a 
second heavy chain. This type of antibody escapes 
antigenic modulation. See, e.g., Glennie et al. (1982). 

Included also within the definition of antibod- 
ies are "Fab" fragments of antibodies- The "Fab" region 
refers to those portions of the heavy and light chains 
which are roughly equivalent, or analogous, to the 
sec[uences which comprise the branch portion of the heavy 
and light chains, and which have been shown to exhibit 
immunological binding to a specified antigen, but which 
lack the effector Fc portion . "Fab" includes aggregates 
of one heavy and one light chain (commonly known as 
Fab'), as well as tetramers containing the 2H and 2L 
chains (referred to as F(ab)2)f which are capable of 
selectively reacting with a designated antigen or antigen 
family- "Fab" antibodies may be- divided into sxibsets 
analogous to those described above, i.e, "vertebrate 
Fab", "hybrid Fab", "chimeric Fab", and "altered Fab". 
Methods of producing "Fab" fragments of antibodies are 
known within the art and include, for exan^Jle, 
proteolysis, and synthesis by recombinant techniques. 

The term "polypeptide" refers to a polymer of 
amino acids and does not refer to a specific length of 
the product; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This 
term also does not refer to or exclude post -expression 
modifications of the polypeptide, for example, 
glycosylations, acetylations, phosphorylations and the 
like. Included within the definition are, for example, 
polypeptides containing one or more analogs of an amino 
acid (including, for example, unnatural amino acids, 
etc.), polypeptides with substituted linkages, as well as 
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other modifications known in the art, both naturally 
occurring and non-naturally occurring. 

"Transformation", as used herein, refers to the 
insertion of an exogenous polynucleotide into a host 
5 cell, irrespective of the method used for the insertion, 
for example, direct uptake, transduction, f -mating or 
electroporation. The exogenous polynucleotide may be 
maintained as a non- integrated vector, for example, a 
plasmid, or alternatively, may be integrated into the 

10 host genome. 

"Treatment" as used herein refers to 
prophylaxis and/or therapy. 

By "immunogenic" is meant an agent used to 
stimulate the immune system of a living organism, so that 

15 one or more functions of the immune system are increased 
and directed towards the immunogenic agent. Immunogenic 
agents include vaccines . Immunogenic agents can be used 
in the production of antibodies, both isolated polyclonal 
antibodies and monoclonal antibodies, using techniques 

20 known in the art. 

By vaccine is meant an agent used to stimulate 
the immune system of a living organism so that protection 
against or amelioration of future haanti is provided. 
Immunization refers to the process of inducing an 

25 increased level of antibodies and/or cellular immune 

response in which T- lymphocytes respond by killing the 
pathogen and/or activate other cells involved in the 
immune response pathway. The antibodies produced as a 
result of immunization may belong to any of the 

30 immunological classes, such as immunoglobulins A, D, E, G 
or M. 

An "individual", as used herein, refers to 
vertebrates, particularly members of the mammalian 
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species, and includes but is not limited to domestic 
animals, sports animals, and primates, including humans. 

As used herein, the "sense strand" of a nucleic 
acid contains the secjuence that has sequence homology to 
5 that of mRNA. The "anti- sense strand" contains a 

sequence which is complementary to that of the "sense 
strand" • 

As used herein, the term "probe" refers to a 
polynucleotide which forms a hybrid structure with a 

10 sequence in a target region, due to complementarity of at 
least one sequence in the probe with a sequence in the 
target region. The polynucleotide regions of probes may 
be composed of DNA, and/or RNA, and/or synthetic 
nucleotide analogs. 

15 As used herein, the term "target region" refers 

to a region of the nucleic acid which is to be amplified 
and/or detected. The term "target sec[uence refers to a 
sequence with which a probe or primer will form a stable 
hybrid under desired conditions. 

20 The term "primer" as used herein refers to an 

oligomer which is capable of acting as a point of 
initiation of synthesis of a polynucleotide strand when 
placed under appropriate conditions. The primer will be 
completely or substantially conrplementary to a region of 

25 the polynucleotide strand to be copied. Thus, under 

conditions conducive to hybridization, the primer will 
anneal to the complementary region of the analyte strand. 
Upon addition of suitable reactants, (e.g., a polymerase, 
nucleotide triphosphates, and the like) , the primer is 

30 extended by the polymerizing agent to form a copy of the 
analyte strand. The primer may be single- stranded, or 
alternatively may be partially or fully double- stranded. 

The terms "analyte polynucleotide" and "analyte 
strand" refer to a single- or double -stranded nucleic 
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acid molecule which is suspected of containing a target 
sequence, and which may be present in a biological 
sample. 

As used herein, a "biological sample" refers to 
5 a sample of tissue or fluid isolated from an individual, 
including but not limited to, for example, plasma, serum, 
spinal fluid, lymph fluid, the external sections of the 
skin, respiratory, intestinal, and genitourinary tracts, 
tears, saliva, milk, blood cells, tiomors, organs, and 

10 also samples of in vitro cell culture constituents 
(including but not limited to conditioned medium 
resulting from the growth of cells in cell culture 
medium, putatively virally infected cells, recombinant 
cells, and cell components) . 

15 As used herein, the term "oligomer" refers to 

primers and to probes. The term oligomer does not 
connote the size of the molecule. However, typically 
oligomers are no greater than 1000 nucleotides, more 
typically are no greater than 500 nucleotides, even more 

20 typically are no greater than 250 nucleotides; they may 

be no greater than 100 nucleotides, and may be no greater 
than 75 nucleotides, and also may be no greater than 50 
nucleotides in length. 

The term "coupled" as used herein refers to 

25 attachment by covalent bonds or by strong non-covalent 
interactions (e.g., hydrophobic interactions , hydrogen 
bonds, etc.). Covalent bonds may be, for example, ester, 
ether, phosphoester, amide, peptide, imide, carbon- sulfur 
bonds, carbon- phosphorus bonds, and the like. 

30 The term "support" refers to any solid or semi- 

solid surface to which a desired polypeptide or 
polynucleotide may be anchored. Suitable supports 
include glass, plastic, metal, polymer gels, and the 
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like, and may take the form of beads, wells, dipsticks, 
membranes, and the like. 

The term "label" as used herein refers to any 
atom or moiety which can be used to provide a detectable 
5 {preferably quantifiable) signal, and which can be 
attached to a polynucleotide or polypeptide. 

The term "type 4 pilin" as used herein refer to 
pilins that contain a conserved amino terminal 
hydrophobic domain beginning with an amino- terminal 

10 phenylalanine that is methylated upon processing and 

secretion of the pilin. Another characteristic feature 
of type 4 pilins is that in the propilin form they 
contain similar six- or seven-amino acid long leader 
peptides, which are much shorter than typical signal 

15 sequences. Type 4 pilins are expressed by several 
bacterial genuses, including NeisserlsL, Moraxella,, 
BacteroldGS r and Pseudomona^s . Species within these 
genuses which express type 4 pilins are, for example, N. 
gonorrhoeaG , N. meningllzldls , Af. bovls, B. nodosus, and 

20 P. aeirug-inosa. As used herein, the term "type 4 pilin" 
also includes the Tcp pilin of Vibrio, (for example, V. 
cholerae) , that is highly homologous to the type 4 pilins 
of other genuses. Tcp pilin contains the characteristic 
amino -terminal hydrophobic domain as well as having a 

25 modified N- terminal amino acid that in this case may be a 
modified methionine because the Tcp pilin gene encodes a 
methionine residue at the position where all the others 
encode a phenylalanine. Precursor TcpA contains a much 
longer leader sequence than typical type 4 propilins but 

30 retains homology in the region surrounding the processing 
site. 

The term "pilC as used herein refers to a gene 
encoding a polypeptide involved in the assembly of type 4 
pilin, which may also be required for attachment of the 
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pilin, and that is comprised of an epitope that is 
irnmunologically identifiable with an epitope in PilC of 

gonorrhae or meningitidis. Included within this 
term is any homologous region from Vibrio, tcpC. 
5 As used herein the term "PilC" refers to a 

polypeptide encoded within pllC, and includes TcpC of 
ViJbrio, 

The description of the method to retrieve the 
DNA sequences is mostly of historical interest. The 

10 resultant sequences (and their complements) are provided 
herein, and the sequences, or any portion thereof, could 
be prepared using synthetic methods, or by a combination 
of synthetic methods with retrieval of partial sequences 
using methods similar to those described herein. 

15 The description infra, of "walking" the genome 

by isolating overlapping DNA sequences from the 
gonorrheae lambda gt-11 library and from an EMBL3 library 
provides one method by which DNAs corresponding to the 
pile genomes from, inter alia, N. gonorrheae and N. 

20 meningitidis, respectively, may be isolated. However, 

given the information provided herein, other methods for 
isolating pllC DNAs from these species, as well as from 
species of other genuses which have type 4 pilin are 
obvious to one of skill in the art. 

25 Characterization of the genes of the pilC loci 

has provided information on the polypeptides encoded 
therein, and on the control of their expression. 
Even though Type 4 pili have been extensively studied in 
several laboratories, little is known about their 

30 assembly. The presence of a specific assembly machinery 
for this class of pili is evident from the fact that the 
pilin gene of JB. nodosus and M. JdovIs can be properly 
processed and assembled into a pilus in P. ateruginosa. but 
not in E. coli (Ellerman et al., 1986; Mattick et al., 
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1987; Beard et al., 1990). Furthermore, the recent 
genetic characterization of TCP pili of Vibrio cholerae 
has revealed that a number of closely linked genes are 
required for pilin processing and assembly into a 
5 structure (Taylor et al., 1988). The TCP pilin does not 
carry an N-methylphenylalanine but its primary sequence 
is highly homologous to the Type 4 class of pilins- 

The gonorrhoeae pilus facilitates adherence 
of the bacterium to a number of eukaryotic cell types 
10 (Watt et al., 1980) and is thought to play a role in 

bacterial interaction with neutrophils (Fischer and Rest, 
1988) . The pilin is encoded from one or two pilE loci 
(Meyer et al., 1984; Swanson et al., 1986) which most 
likely each form a monocistronic operon. Hence, there 
15 have been no suggestions that genes closely linked to 
pilE are involved in pilus assembly. A dispersed 
location of genes involved in gonococcal pilus assembly 
as well as the rapid occurrence of nonpiliated variants 
generated via recombination with pilin sequences from 
20 silent loci, pilS, have made it extremely difficult to 
identify putative assembly genes for gonococcal pili. 

The Pile protein described herein is a protein 
encoded within a pilC or equivalent (for example, tcpC) 
locus or gene. In N. gonorrhoeas MSll and most other 
25 gonococcal strains the PilC protein is expressed in small 
amounts. It is the only protein that is enriched in 
highly purified preparations of MSll pili. PilC was not 
released from a nonpilated MSll (P"n) variant using the 
same procedure suggesting that this protein interacts 
30 with the polymerized pilus fiber. 

DNA sec[uence analysis of the cloned pilCl gene 
revealed one long open reading frame that was out of 
frame with its putative AUG initiation codon and 5' end 
encoding the signal peptide. Minute amounts of PilC were 
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expressed in E. coli from pABJ04. Gel purified PilC from 
MSll contained a lysine residue in position four, whereas 
pllCl had a glutamine codon at this position. A lysine 
codon was, however, found at position four in a number of 
5 PGR amplified 5' pilC fragments suggesting that these 

fragments represent the 5' end of p±lC2, which then must 
be ON in MSll. The finding that a miniTnCm insertion in 
pilC2 abolished PilC expression, whereas insertional 
inactivation of pllCl did not abolish PilC expression 

10 further argues that pilCl is translationally out of frame 
and pilC2 translationally in frame in the MSll variant we 
are studying. 

PGR amplified fragments of pxlCl and p±lC2 in 
MSll differed in the number of G residues found in the G 

15 tract. Only 11 or 12 Gs were found in pilCl clones 

(which would both generate an OFF phenotype) while 12 or 

13 Gs were found among pilC2 specific clones. Since 
pilC2 is the expressed gene in the MSll variant under 
study, we believe that this variant carries 13 Gs in 

20 p±lC2 and 12 Gs in pllCl. The frequency of frameshift 

mutations in each locus is not known. However, the lack 
of 13 Gs among pllCl specific fragments and the lack of 

14 Gs among pilC2 specific fragments suggests that a 
deletion of one G residue occurs at a higher frequency 

25 than the insertion of one G residue. We had expected to 
find amplified fragments from N. gonorrhoeae containing 
10 G residues in the G tract, but found none in the 48 
clones sequenced. If only one G is added or deleted in 
each mutational event, the frequency of G tracts with 10 

30 residues should be low if G tracts normally are 12 or 13 
bp long. 

Frameshif ting in pilCl also occurred in E. 
coli. In this case, however, two variants with 10 
residues were found out of 12 clones sequenced. It may 
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therefore be that there is a selection against in frame 
variants with 10 Gs in N. g-onorrhoeae . A change from 
five glycines to four in the signal peptide may for 
example have an effect on the physical properties of the 
5 precursor form of PilC such that the signal peptide is 
not cleaved off. E. coll strain AAIO is recA. 
Therefore, frameshift mutations in the G tract of pllC 
occurs independent of the RecA protein. 

Translational f rameshif ting has been shown to 

10 regulate phase and antigenic variation of the gonococcal 
opacity protein PII that is encoded by a nxomber of opa 
loci showing sequence variations. In this system a 
number of pentameric CTCTT repeats are present in the 
region encoding the signal peptide (Stem et al. 1986) . 

15 Variation in the niimber of repeats is independent of recA 
in gonorrhoeae as well as in E, coll (Murphy et al • , 
1989). Variation in the expression of lipopoly saccharide 
epitopes in Haemophilus Influenzae was recently explained 
by translational f rameshif ting created by alterations in 

20 the number of CAAT repeats occurring in the 5' end of 
licA (Weiser et al., 1989). In Bordetella pertussis 
frameshift mutations in the regulatory vir locus occur in 
a run of C residues positioned internally in the gene 
(Stibitz et al,, 1989). The C tract was in this case 

25 varying from 6 (in frame) to 7 residues (out of frame) . 

It is not known if this frameshift mutation is programed 
or not. The pilin gene of Bordetella pertussis was 
recently shown to be preceded by a stretch of Cs. 
Frequent mutations affecting the length of this C tract 

30 influenced the transcriptional activity of the pilin gene 
(Willems et al., 1990). 

Variation in the number of the CTCTT repeats in 
opa genes was recently suggested to be due to 
recombination- independent slipped strand mispairing 
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(Murphy et al*, 1989), Mispairing is thought to occur 
between strands subjected to local denaturation and 
should preferentially occur during replication* A number 
of unusual DNA structures (cruciform, Z form, H form) 
5 have been shown to form in vitro within a variety of 
specific DNA sequences. Under normal conditions the B 
form is the most favorable thermodynamically (Frank- 
Kamenetskii and Vologodskii, 1984) , Transition to 
alternative conformations requires specific external 

10 conditions, supercoiling being the most physiologic. 
Single stranded {ciG)^^ and idC) ^ strands renature more 
slowly than coit5)lementary strands with arbitrary 
sequences, and methylation experiments suggest that a 
poly dG chain may form a hairpin- like structure 

15 stabilized by G-G bp (Panyutin et al . , 1990) . 

Four variant sequences differing outside the G 
tract were obtained by PGR aurplif ication of the 5' end of 
pile from four N. gonorrhoea^e strains. The region 5' of 
the G tract was invariant, as was the 3' end of the 

20 ait^lified region. All variation was confined to a region 
located 3' of the G tract. At least some of these 
sequence variations can be explained by mismatch pairing 
events. Thus, the addition of four nucleotides distal to 
the G tract in variant sequence 4 is possible to explain 

25 by a two step mispairing event occurring within variant 
sequence 2. Slip strand mispairing between the two CA 
residues in -GGCGCAGGCGCA- would yield 
-GGCGCAGGCGCAGGCGCA- . A second mispairing event 
occurring between the two C- residues at positions 3 and 5 

30 gives rise to the sequence -GGCAGGCGCAGGCGCA- present in 
variant 4. It may therefore be that a sequence close to 
a poly(G) tract is prone to slipped strand mispairing. 

Gonococcal pilus. phase variation is associated 
with an altered nucleotide sequence of pilE via 
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recombinations with silent pllS sequences (Haas and 
Meyer, 1986, Swanson et al . , 1986). An irreversible 
switch OFF in pilus expression results from deletions of 
the 5' coding and control regions of the pilE locus 
5 (Swanson et al-, 1985). Reversible gonococcal pilus 

phase variation is associated with nucleotide changes in 
pllE resulting in an altered pilin product. It has been 
suggested that the pilins of these variants are assembly 
defective (Bergstrom et al., 1986; Swanson et al., 1986; 

10 Hill et al., 1990). Here we present evidence that switch 
OFF and ON of PilC expression causes pilus expression to 
phase vary. Five out of five P~, pilin producing 
descendants from MSllj^^^ {P"^, PilC"*") that expressed pilin 
did not express PilC. All tested P"*" revertants from the 

15 five P", PilC" variants had regained expression of PilC. 
The pilin of one nonpiliated PilC OFF- switcher (variant 
8) differed by eight amino acids from that of the parent. 
The fact that one piliated PilC''' backswitcher (8:1) 
expressed a pilin identical in sequence to the 

20 nonpiliated variant (8) strongly suggests that the 

regained expression of pili is due to an ON- switch in 
Pile expression. The above results also inply that the 
nonpiliated phenotype of variant 8 is not due to the 
alterations in the pilin relative to the parental strain 

25 but to an OFF- switch of PilC. The finding that mTnCm 
insertions resulted in P"*" colonies when inserted into 
pllCl and P* colonies when inserted into the actively 
expressing p±lC2 locus offers further evidence that PilC 
is essential for the biogenesis of gonococcal pili- P , 

30 p±lC2i :mTnCm-12 insertion mutants reverted to P"*" colony 
morphology at a low frequency. These revertants most 
likely represent f rameshif ting mutants in pllCl resulting 
in expression of PilC from this locus. A doxable mutant 
in pllCl and pilC2 was stably nonpiliated, expressed 
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pilin, but expressed amounts of pilin that did not 
express any pili when examined by transmission electron 
microscopy. It is therefore believed that out of frame 
mutations of both pilCl and pilC2 will abolish pili 
5 formation. 

At this stage we cannot exclude the possibility 
that some PilC" variants from MSII^^ (P***, PilC" ) are 
generated by transformation of pilCl sequences and 
homologous recombination with pilC2 thus generating 

10 variants with two pi I CI 5' ends at both pllC loci. PilC"*" 
revertants from PilC' clones must, however, all be due to 
frameshift mutations in either pilCl or p±lC2. 

We propose that PilC forms an outer membrane 
pore or assembly center enabling the pilin subunits to be 

15 assembled and translocated across the outer membrane 

analogous to the proposed function of the high molecular 
weight proteins required for the assembly of 
enterobacterial pili (the latter of which is discussed in 
Norgren et al., 1987). Alternatively, PilC may act as an 

20 initiator for polymerization. In the latter case PilC 
would be expected to be located at the tip of the 
polymerized pilus . 

It is possible that the alternate expression of 
PilC from two structurally different pilC loci is yet 

25 another example of antigenic variation in Neisseria 
gonorrhoeae. It is, however, possible that this 
variation could have functional implications as well. 
Each class of E. coli pili utilizes a different outer 
membrane pore/assembly protein. Hence, pilin subunits 

30 and/or periplasmic chaperone complexes may specifically 
interact with an exposed region of the protein allowing 
polymerization of pilus subunit proteins. The repertoire 
of antigenic variants of gonococcal pilins is vast 
(Hagblom et al,, 1985) . It may be that only certain 
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pilin variants are assembled via PilCl and PilC2 
respectively. Alternatively, if PilC acts as an 
initiator it could also possess other properties such as 
being involved in Pilus mediated attachment. 
5 In one embodiment of the invention, 

immimogenically active polypeptides encoded within pilC 
are prepared. The availability of p±lC DNA sequences, 
either those isolated by utilizing the DNA secjuences 
described in the Examples, or nucleotide sequences 

10 derived therefrom (including segments and modifications 
of the sequence) , permits the construction of expression 
vectors encoding immunologically reactive regions of the 
polypeptide encoded in either strand. Immunological 
reactivity may be determined by immunoassay using 

15 antibodies raised to PilC. Fragments encoding the 

desired polypeptides are derived from the DNA clones 
using conventional restriction digestion or by synthetic 
methods, and are ligated into vectors which may, for 
exait^jle, contain portions of fusion sequences such as 

20 beta-galactosidase or superoxide dismutase (SOD) , prefer- 
ably SOD. Methods and vectors which are useful for the 
production of polypeptides which contain fusion sequences 
of SOD are described in European Patent Office 
Publication number 0196056, published October 1, 1986. 

25 Any desired portion of the p±lC DNA containing an open 

reading frame, in either sense strand, can be obtained as 
a recombinant polypeptide, such as a mature or fusion 
protein; alternatively, a polypeptide encoded in the DNA 
can be provided by chemical synthesis. 

30 The DNA encoding the desired polypeptide, 

whether in fused or mature form, and whether or not 
containing a signal sequence to permit secretion, may be 
ligated into expression vectors suitable for any 
convenient host. Both eukaryotic and prokaryotic host 
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systeins are presently used in forming recombinant 
polypeptides, and a summary of some of the more common 
control systems and host cell lines is given infra. The 
polypeptide is then isolated from lysed cells or from the 
5 culture medium and purified to the extent needed for its 
intended use. Purification may be by techniques known in 
the art, for example, differential extraction, salt 
fractionation, chromatography on ion exchange resins, 
affinity chromatography, centrifugation, and the like. 

10 See, for example. Methods in Enzymology for a variety of 
methods for purifying proteins. Such polypeptides can be 
used as diagnostics, or those which give rise to 
neutralizing antibodies may be formulated into vaccines. 
Antibodies raised against these polypeptides can also be 

15 used as diagnostics, or for passive immunotherapy. 

The Pile antigens may also be isolated from 
meningococci and from gonococci. The bacteria may be 
grown by conditions known in the art, some of which are 
described infra. In addition, a method for isolating , 

20 Pile from gonococci is described infra. 

In another embodiment of the invention, the 
immunoreactive polypeptides may be conjugated with 
carrier. An antigenic region of a polypeptide is gener- 
ally relatively small- -typically 8 to 10 amino acids or 

25 less in length. Fragments of as few as 5 amino acids may 
characterize an antigenic region. These segments may 
correspond to regions of PilC antigen. Accordingly, 
using the DNAs of pxlC as a basis, DNAs encoding short 
segments of PilC polypeptides can be expressed 

30 recombinantly either as fusion proteins, or as isolated 
polypeptides. In addition, short amino acid sequences 
can be conveniently obtained by chemical synthesis. In 
instances wherein the synthesized polypeptide is 
correctly configured so as to provide the correct 
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epitope, but is too small to be immunogenic, the 
polypeptide may be linked to a suitable carrier. 

A number of techniques for obtaining such link- 
age are known in the art, including the formation of 
5 disulfide linkages using 

N-succinimidyl-3- (2-pyridyl-thio) propionate (SPDP) and 
succinimidyl 

4- (N-maleimido -methyl) cyclohexane-l-carboxylate (SMCC) 
obtained from Pierce Corr^any, Rockford, Illinois, (if the 

10 peptide lacks a sulfhydryl group, this can be provided by 
addition of a cysteine residue.) These reagents create a 
disulfide linkage between themselves and peptide cysteine 
residues on one protein and an amide linkage through the 
epsilon-amino on a lysine, or other free amino group in 

15 the other. A variety of such disulfide/amide- forming 

agents are known. See, for example, Immun. Rev. (1982) 
62:185. Other bifunctional coupling agents form a 
thioether rather than a disulfide linkage. Many of these 
thio- ether- forming agents are commercially available and 

20 include reactive esters of 6-maleimidocaproic acid, 
2-bromoacetic acid, 2-iodoacetic acid, 

4- (N-maleimido-methyl) cyclohexane-l-carboxylic acid, and 

the like. The carboxyl groups can be activated by 

combining them with succinimide or 
25 l-hydroxyl-2-nitro-4-sulf onic acid, sodium salt. 

Additional methods of coupling antigens employs the 

rotavirus/ "binding peptide" system described in EPO Pub. 

No- 259,149, the disclosure of which is incorporated 

herein by reference. The foregoing list is not meant to 
30 be exhaustive, and modifications of the named confounds 

can clearly be used. 

Any carrier may be used which does not itself 

induce the production of antibodies harmful to the host. 

Suitable carriers are typically large, slowly metabolized 
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macromolecules such as proteins; polysaccharides, such as 
latex functionalized sepharose, agarose, cellulose, cel- 
lulose beads and the like; polymeric amino acids, such as 
polyglutamic acid, polylysine, and the like; amino acid 
5 copolymers; and inactive virus particles or attenuated 
bacteria of other strains, for example, those of 
Salmonella, Especially useful protein substrates are 
serum albumins, keyhole limpet hemocyanin, immunoglobulin 
molecules, thyroglobulin, ovalbiamin, tetanus toxoid, and 

10 other proteins well known to those skilled in the art. 

In addition to full-length PilC proteins, 
polypeptides comprising truncated PilC amino acid 
sequences encoding at least one immunologically reactive 
epitope are useful immunological reagents. For example, 

15 polypeptides comprising such truncated sequences can be 
used as reagents in an immunoassay. These polypeptides 
also are candidate subunit antigens in compositions for 
antiserum production or vaccines. While these truncated 
sequences can be produced by various known treatments of 

20 native bacterial protein, it is generally preferred to 
make synthetic or recombinant polypeptides comprising a 
PilC sequence. Polypeptides comprising these truncated 
PilC sequences can be made up entirely of PilC sequences 
(one or more epitopes, either contiguous or 

25 noncontiguous) , or PilC sequences and heterologous 
sequences in a fusion protein. Useful heterologous 
sequences include sequences that provide for secretion 
from a recombinant host, enhance the immunological 
reactivity of the PilC epitope (s), or facilitate the 

3 0 coupling of the polypeptide to an immunoassay support or 
a vaccine carrier. See, e.g., EPO Pub. No. 116,201; 
U.S. Pat. No, 4,722,840; EPO Pub. No. 259,149; U.S. 
Pat. No. 4,629,783, the disclosures of which are 
incorporated herein by reference. 
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The size of polypeptides coit^^rising the 
truncated PilC sequences can vary widely, the minimum 
size being a sequence of sufficient size to provide an 
immunologically reactive PilC epitope, while the maximum 
5 size is not critical. For convenience, the maximum size 
usually is not substantially greater than that required 
to provide the desired PilC epitopes and function (s) of 
the heterologous sequence, if any. Typically, the 
truncated PilC amino acid sequence will range from about 

10 5 to about 100 amino acids in length. More typically, 

however, the PilC sequence will be a maximum of about 50 
amino acids in length, preferably a maximum of about 3 0 
amino acids. It is usually desirable to select PilC 
sequences of at least about 10, 12 or 15 amino acids, up 

15 to a maximum of about 20 or 25 amino acids. 

Truncated PilC amino acid sequences comprising 
epitopes can be identified in a number of ways. For 
example, the entire PilC protein sequence can be screened 
by preparing a series of short peptides that together 

20 span the entire protein sequence. By starting with, for 
exanple, lOOmer polypeptides, it would be routine to test 
each polypeptide for the presence of epitope (s) showing a 
desired reactivity, and then testing progressively 
smaller and overlapping fragments from an identified 

25 lOOmer to map the epitope of interest. Screening such 
peptides in an immunoassay is within the skill of the 
art. It is also known to carry out a computer analysis 
of a protein sequence to identify potential epitopes, and 
then prepare oligopeptides comprising the identified 

30 regions for screening. 

In another embodiment of the invention, the 
immunogenicity of the epitopes of PilC may also be 
enhanced by preparing them in mammalian or yeast systems 
fused with or assembled with particle- forming proteins 
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such as, for example, that associated with hepatitis B 
surface antigen. See, e.g., U.S. Pat, No. 4,722,840. 
Constructs wherein the PilC epitope is linked directly to 
the particle- forming protein coding sequences produce 
5 hybrids which are immunogenic with respect to the PilC 
epitope. In addition, all of the vectors prepared 
include epitopes specific to HBV, having various degrees 
of immunogenicity, such as, for example, the pre-S 
peptide. Thus, particles constructed from particle 

10 forming protein which include PilC sequences are 

immunogenic with respect to the microorganism encoding 
the PilC epitope (for example. Neisseria.^ Vibrio, 
MoraxellsL, Bacteroides , or Pseudomonas) and HBV. 

Hepatitis surface antigen (HBSAg) has been 

15 shown to be formed and assembled into particles in S. 

cerevisiae (Valenzuela et al. (1982)), as well as in, for 
example, mammalian cells (Valenzuela, P., et al. (1984)). 
The formation of such particles has been shown to enhance 
the immunogenicity of the monomer subunit. The 

20 constructs may also include the immunodominant epitope of 
HBSAg, comprising the 55 amino acids of the presurface 
(pre-S) region. Neurath et al. (1984). Constructs of 
the pre-S -HBSAg particle expressible in yeast are 
disclosed in EPO 174,444, published March 19, 1986; 

25 hybrids including heterologous viral sequences for yeast 
expression are disclosed in EPO 175,261, pxxblished March 
26, 19 66. These constructs may also be expressed in 
mammalian cells such as Chinese hamster ovary (CHO) cells 
using an SV40-dihydrof olate reductase vector (Michelle et 

30 al. (1984) ) , 

In addition, portions of the particle- forming 
protein coding sequence may be replaced with codons 
encoding a PilC epitope. In this replacement, regions 
which are not required to mediate the aggregation of the 
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units to form immunogenic particles in yeast or mammals 
can be deleted, thus eliminating additional HBV antigenic 
sites from competition with the PilC epitope. 

In another embodiment of the invention, the 
5 iramunoreactive polypeptides encoded in pilC are prepared 
into vaccines. Vaccines may be prepared from one or more 
immunogenic polypeptides derived from pllC* If 
recombinant, these polypeptides may be expressed in 
various host cells {e.g., bacteria, yeast, insect, or 

10 mammalian cells) , or alternatively may be isolated from 

the bacterial preparations- In addition to the above, it 
is also possible to prepare live vaccines of attenuated 
microorganisms which express one or more recombinant 
polypeptides derived from the pilC gene. Suitable 

15 attenuated microorganisms are known in the art and 

include, for example, viruses (e.g., vaccinia virus (see 
Brown et al. (1986)), as well as bacteria. 

The preparation of vaccines which contain an 
immunogenic polypeptide (s) as active ingredients, is 

20 known to one skilled in the art. Typically, such 

vaccines are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solu- 
tion in, or suspension in, liquid prior to injection may 
also be prepared. The preparation may also be 

25 emulsified, or the protein encapsulated in liposomes. 

The active immunogenic ingredients are often mixed with 
excipients which are pharmaceutical ly acceptable and 
compatible with the active ingredient. Suitable 
excipients are, for example, water, saline, dextrose, 

30 glycerol, ethanol, or the like and combinations thereof. 
In addition, if desired, the vaccine may contain minor 
amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, and/or adjuvants 
which enhance the effectiveness of the vaccine. Examples 
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of adjuvants which may be effective include but are not 
limited to: aluminum hydroxide, N-acetyl-muramyl- 
L- threonyl-D-isoglutamine (thr-MDP) , N-acetyl -nor- 
muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to 
5 as nor-MDP) , N-acetylmuramyl-L-alanyl-D- isoglutaminyl- 
L-alanine-2- {1' -2' -dipalmitoyl-sn-glycero-3-hydroxyphosph 
oryloxy) -ethylamine (CGP 19835A, referred to as MTP-PE) , 
and RIBI, which contains three components extracted from 
bacteria, monophosphoryl lipid A, trehalose dimycolate 

10 and cell wall skeleton (MPL+TDM+CWS) in a 2% 

squalene/Tween 80 emulsion. The effectiveness of an 
adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide 
containing a PilC immunoreactive sequence resulting from 

15 administration of this polypeptide in vaccines which are 
also comprised of the various adjuvants. 

The vaccines are conventionally administered 
parenterally, by injection, for example, either 
subcutaneous ly or intramuscularly- Additional 

20 formulations which are suitable for other modes of 

administration include suppositories and, in some cases, 
oral formulations. For suppositories, traditional 
binders and carriers may include, for example, 
polyalkylene glycols or triglycerides; such suppositories 

25 may be formed from mixtures containing the active 

ingredient in the range of 0.5% to 10%, preferably l%-2%. 
Oral formulations include such normally employed 
excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodiiim 

30 saccharine, cellulose, magnesium carbonate, and the like. 
These compositions take the form of solutions, suspen- 
sions, tablets, pills, capsules, sustained release 
formulations or powders and contain 10% -95% of active 
ingredient, preferably 25%- 70%. 
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The proteins may be formulated into the vaccine 
as neutral or salt forms. Pharmaceutically acceptable 
salts include the acid addition salts (formed with free 
amino groups of the peptide) and which are formed with 
5 inorganic acids such as, for exaitple, hydrochloric or 
phosphoric acids, or such organic acids such as acetic, 
oxalic, tartaric, maleic, and the like. Salts formed 
with the free carboxyl groups may also be derived from 
inorganic bases such as, for example, sodium, potassium, 

10 ammonium, calcixim, or ferric hydroxides, and such organic 
bases as isopropylamine, trimethylamine , 2-ethylamino 
ethanol, histidine, procaine, and the like. 

The vaccines are administered in a manner 
compatible with the dosage formulation, and in such 

15 amount as will be prophylactically and/or therapeutically 
effective. The quantity to be administered, which is 
generally in the range of 5 micrograms to 250 micrograms 
of antigen per dose, depends on the subject to be 
treated, capacity of the subject's immune system to 

20 synthesize antibodies, and the degree of protection 

desired. Precise amounts of active ingredient required 
to be administered may depend on the judgment of the 
practitioner and may be peculiar to each siibject. 

The vaccine may be given in a single dose 

25 schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course 
of vaccination may be with 1-10 separate doses, followed 
by other doses given at subsequent time intervals 
required to maintain and or reenforce the immune 

3 0 response, for example, at 1-4 months for a second dose, 

and if needed, a subsequent dose(s) after several months. 
The dosage regimen will also, at least in part, be 
determined by the need of the individual and be dependent 
upon the judgment of the practitioner. 
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In addition, the vaccine containing the im- 
munogenic antigen (s) derived from pilC may be 
administered in conjunction with other iramunoregulatory 
agents, for example, immune globulins. 
5 Another embodiment of the invention are 

antibodies which react immunologically with PilC 
epitopes. The immunogenic polypeptides prepared as 
described above are used to produce antibodies, including 
polyclonal and monoclonal. If polyclonal antibodies are 

10 desired, a selected mammal (e.g., mouse, rabbit, goat, 

horse, etc.) is immunized with an immunogenic polypeptide 
bearing a PilC epitope (s) . Serxim from the immunized 
animal is collected and treated according to known 
procedures. If serxim containing polyclonal antibodies to 

15 a PilC epitope (i.e., an epitope encoded within pilC) 
contains antibodies to other antigens, the polyclonal 
antibodies can be purified by immunoaf f inity 
chromatography. Techniques for producing and processing 
polyclonal antisera are known in the art, see for 

20 example, Mayer and Walker (1987) . 

Monoclonal antibodies directed against PilC 
epitopes can also be readily produced by one skilled in 
the art. The general methodology for making monoclonal 
antibodies by hybridomas is well known. Immortal 

25 antibody -producing cell lines can be created by cell 
fusion, and also by other techniques such as direct 
transformation of B lymphocytes with oncogenic DNA, or 
transfection with Epstein-Barr virus. See, e.g., M. 
Schreier et al. (1980); Hammerling et al . (1981); Kennett 

30 et al, (1980); see also , U.S. Patent Nos . 4,341,761; 

4,399,121; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 
4,491,632; and 4,493,890. Panels of monoclonal 
antibodies produced against PilC epitopes can be screened 
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for various properties; i.e., for isotype, epitope affin- 
ity, etc. 

Antibodies, both monoclonal and polyclonal, 
which are directed against PilC epitopes are particularly 
5 useful in diagnosis, and those which are neutralizing are 
useful in passive immunotherapy. Methods for introducing 
antibodies into an individual to accomplish passive 
immunotherapy are known in the art. In addition, 
monoclonal antibodies, in particular, may be used to 

10 raise anti-idiotype antibodies. 

Anti-idiotype antibodies are immunoglobulins 
which carry an "internal image" of the antigen of the 
infectious agent against which protection is desired. 
See, for example, Nisonoff, A., et al. (1981) and 

15 Dreesman et al. (1985). Techniques for raising 

anti-idiotype antibodies are known in the art. See, for 
example, Grzych (1985), MacNamara et al, (1984), and 
Uytdehaag et al, (1985) . These anti-idiotype antibodies 
may also be useful for treatment, vaccination and/or 

20 diagnosis of the relevant microorganism encoding the 
antigen of interest, (for example. Neisseria, 
Psendomonas , Moraxella, Bacteroides, or Vijbrio) as well 
as for an elucidation of the immunogenic regions of PilC. 

Another embodiment of the invention concerns 

25 immunoassays and diagnostic kits. The polypeptides which 
contain epitopes encoded in pllC which are immunoreactive 
with anti-PilC antibodies in biological samples are 
useful in immunoassays to detect presence of anti-PilC 
antibodies, or the presence of the relevant microorganism 

3 0 or its antigens in biological samples. Design of the 

immunoassays is subject to a great deal of variation, and 
many formats are known in the art. The immunoassay will 
utilize a polypeptide comprised of at least one epitope 
derived from PilC or encoded in pilC. In one embodiment, 
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the immunoassay 'uses a combination of epitopes including 
the one derived from PilC or encoded in pilC. These 
epitopes may be derived from the same or from different 
polypeptides, and may be in separate recombinant or 
5 natural polypeptides, or together in the same recombinant 
polypeptides. An immunoassay may use, for example, a 
monoclonal antibody directed towards an epitope (s), a 
combination of monoclonal antibodies directed towards 
epitopes of one antigen, monoclonal antibodies directed 

10 towards epitopes of different antigens, polyclonal 
antibodies directed towards the same antigen, or 
polyclonal antibodies directed towards different 
antigens. Protocols may be based, for example, upon 
competition, or direct reaction, or sandwich type assays, 

15 Protocols may also, for example, use solid supports, or 
may be by immunoprecipitation. Most assays involve the 
use of labeled antibody or polypeptide; the labels may 
be, for example, enzymatic, fluorescent, 

chemiluminescent , radioactive, or dye molecules. Assays 
20 which amplify the signals from the probe are also known; 
examples of which are assays which utilize biotin and 
avidin, and enzyme- labeled and mediated immunoassays, 
such as ELISA assays • 

Typically, an immunoassay for an anti-PilC 
25 antibody (s) will involve selecting and preparing the test 
sample suspected of containing the antibodies, such as a 
biological sample, then incubating it with an 
immunoreactive (also called antigenic) polypeptide (s) 
containing at least one epitope encoded in pilC. The 
30 incubation is under conditions that allow 

antigen- antibody complexes to form. Suitable incubation 
conditions are well known in the art. Subsequent to the 
incubation, complexes which are formed which contain the 
immunoreactive polypeptide are detected. The immunoassay 
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may be, without limitations, in a heterogenous or in a 
homogeneous format, and of a standard or competitive 
type. 

In a heterogeneous format, the polypeptide is 
5 typically bound to a solid support to facilitate separa- 
tion of the sample from the polypeptide after incubation. 
Examples of solid supports that can be used are nitro- 
cellulose (e.g., in membrane or microtiter well form), 
polyvinyl chloride (e.g., in sheets or microtiter wells), 
10 polystyrene latex (e.g., in beads or microtiter plates, 
polyvinyl idine fluoride (known as Immulon) , diazotized 
paper, nylon mert±)ranes, activated beads, and Protein A 

1 -I 2 

beads. For example, Dynatech Immulon or Immulon 

microtiter plates or 0.25 inch polystyrene beads (Preci- 
15 sion Plastic Ball) can be used in the heterogeneous 
format. The solid support containing the antigenic 
polypeptide is typically washed after separating it from 
the test sample, and prior to detection of bound antibod- 
ies. Both standard and competitive formats are known in 
20 the art- 

In a homogeneous format, the test sample is 
incubated with antigen in solution. For example, it may 
be under conditions that will precipitate any 
antigen- antibody complexes which are formed. Both 

25 standard and coit^etitive formats for these assays are 
known in the art. 

In a standard format, the amount of anti-PilC 
antibodies forming the antibody- antigen complex is 
directly monitored. This may be accomplished by 

30 determining whether labeled ant i- xenogeneic (e.g., 

anti -human) antibodies which recognize an epitope on 
anti-PilC antibodies will bind due to complex formation . 
In a competitive format, the amoimt of anti-PilC 
antibodies in the sample is deduced by monitoring the 
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competitive effect on the binding of a known amount of 
labeled antibody (or other competing ligand) in the 
complex. 

Complexes formed comprising anti-PilC antibody 
5 (or, in the case of competitive assays, the amount of 
competing antibody) are detected by any of a number of 
known techniques, depending on the format. For example, 
unlabeled anti-PilC antibodies in the complex may be 
detected using a conjugate of antixenogeneic Ig complexed 

10 with a label, (e.g., an enzyme label). 

In immunoassays where PilC polypeptides are the 
analyte, the test sample, which may be a biological 
sample, is incubated with anti-PilC antibodies under 
conditions that allow the formation of antigen -antibody 

15 complexes. Various formats can be employed. For 
example, a "sandwich assay" may be employed, where 
antibody bound to a solid support is incubated with the 
test sample; washed; incubated with a second, labeled 
antibody to the analyte, and the support is washed again. 

20 Analyte is detected by determining if the second antibody 
is bound to the support. In a conpetitive format, which 
can be either heterogeneous or homogeneous, a test sample 
is usually incubated with antibody and a labeled, 
competing antigen is also incubated, either sequentially 

25 or simultaneously. These and other formats are well 
known in the art. 

The antigenic regions of the polypeptides 
encoded in pilC can be mapped and identified by screening 
the antigenicity of expression products of pllC DNAs 

30 which encode portions of the PilC, The expression 

products may be from a variety of expression systems, 
including, for example bacterial systems, yeast systems, 
insect systems, and eukaryotic cell systems. In 
addition, studies giving rise to an antigenicity index 
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and hydrophobicity/hydrophilicity profile give rise to 
information concerning the probability of a region's 
antigenicity . 

Efficient detection systems for infection with 
pathogenic microorganisms, (for example, Neisseria, 
Pseudomonas , Bacteroldes , Moxraxella, or Vibrio) may 
include the use of panels of epitopes. The epitopes in 
the panel may be constructed into one or multiple 
polypeptides. At least one of the epitopes will be 
encoded in pllC or derived from PilC, The assays for the 
varying epitopes may be sequential or simultaneous. 

Kits suitable for immunodiagnosis and contain- 
ing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the 
polypeptides of the invention containing PilC epitopes 
(i.e., epitopes encoded within pllC) or antibodies 
directed against PilC epitopes in suitable containers. 
The kit may also contain other reagents^ for example, 
buffer and standard, as well as other materials required 
for the conduct of the assay, as well as a suitable set 
of instructions for conducting the assay using the kit 
materials . 

Another embodiment of the invention are 
oligomers. Using the disclosed portions of the pllC DNAs 
as a basis, oligomers of approximately 8 nucleotides or 
more can be prepared, either by excision from recombinant 
polynucleotides or by synthetic methods which are known 
in the art. These oligomers can serve as probes for the 
detection (including isolation and/or labeling) of 
polynucleotides which contain pllC sequences, and/or as 
primers for the transcription and/or replication of 
targeted pllC sequences , The oligomers contain a 
targeting polynucleotide sec[uence, which is comprised of 
nucleotides which are complementary to a target pllC 
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nucleotide sequence; the sequence is of sufficient length 
and complementarity with the pllC sequence to form a 
duplex which has sufficient stability for the purpose 
intended. For example, if the purpose is the isolation, 
5 via immobilization, of an analyte containing a target 
pile sequence, the oligomers would contain a 
polynucleotide region which is of sufficient length and 
complementarity to the targeted p±lC sequence to afford 
sufficient duplex stability to immobilize the analyte on 

10 a solid surface, via its binding to the oligomers, under 
the isolation conditions* For example, also, if the 
oligomers are to serve as primers for the transcription 
and/or replication of target pilC sequences in an analyte 
polynucleotide, the oligomers would contain a 

15 polynucleotide region of sufficient length and 

complementarity to a region flanking the targeted pilC 
sequence to allow the polymerizing agent to continue 
replication from the primers which are in stable duplex 
form with the target sequence, under the polymerizing 

2 0 conditions. The oligomers may contain a minimum of about 

4 contiguous nucleotides which are complementary to a 
targeted pilC sequence; usually the oligomers will 
contain a minimtam of about 8 contiguous nucleotides which 
are complementary to the targeted pilC sequence, and 
25 preferably will contain a minimum of about 14 contiguous 
nucleotides which are complementary to the targeted pilC 
sequence . 

The oligomer, however, need not consist only of 
the sequence which is complementary to the targeted pilC 

3 0 sequence. It may contain in addition, nucleotide 

sequences or other moieties which are suitable for the 
purposes for which the oligomers are used. For example, 
if the oligomers are used as primers for the 
amplification of targeted pllC sequences via the 
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polymerase chain reaction (PGR) , they may contain 
sequences which, when in duplex, form restriction enzyme 
sites which facilitate the cloning of the amplified 
sequences. Other types of moieties or seqpiences which 
5 are useful of which the oligomers may be comprised or 
coupled to, are those which are known in the art to be 
suitable for a variety of purposes, including the 
labeling of nucleotide probes* 

In the basic nucleic acid hybridization assay, 

10 single-stranded analyte nucleic acid (either DNA or RNA) 
is hybridized to a nucleic acid probe, and resulting 
duplexes are detected- The probes for pllC sequences 
(natural or derived) are a length which allows the 
detection of these sequences by hybridization. While 

15 6-8 nucleotides may be a workable length, sequences of 

10-12 nucleotides are preferred, and about 20 nucleotides 
or more appears optimal. Preferably, these sequences 
will derive from regions which lack heterogeneity. These 
probes can be prepared using routine methods, including 

20 automated oligonucleotide synthetic methods. For use as 
probes, complete conqplementarity is desirable, although 
it may be unnecessary as the length of the fragment is 
increased. 

For use of such probes as agents to detect the 
25 presence of pllC sequences, the sample to be analyzed 

(which may be biological) may be treated, if desired, to 
extract the nucleic acids contained therein. The 
resulting nucleic acid from the sample may be subjected 
to gel electrophoresis or other size separation 
30 techniques; alternatively, the nucleic acid sample may be 
dot blotted without size separation. In order to form 
hybrid duplexes with the targeting sequence of the probe, 
the targeted region of the analyte nucleic acid must be 
in single- stranded form. The latter may occur naturally; 
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altematively, it may be accomplished by denaturation. 
Denaturation can be accomplished by various techniques 
known in the art. Subsequent to denaturation, the 
analyte nucleic acid and probe are incubated under 
5 conditions which promote stable hybrid formation of the 
target sequence in the probe with the putative targeted 
sequence in the analyte, and the resulting duplexes 
containing probe (s) are detected. 

Detection of the resulting duplex, if any, is 

10 usually accomplished by the use of labeled probes; 

alternatively, the probe may be unlabeled, but may be 
detectable by specific binding with a ligand which is 
labeled, either directly or indirectly. Suitable labels, 
and methods for labeling probes and ligands are known in 

15 the art, and include, for example, radioactive labels 
which may be incorporated by known methods (e.g., nick 
translation or kinasing) , biotin, fluorescent groups, 
chemiluminescent groups (e.g., doxetanes, particularly 
triggered dioxetands) , enzymes, antibodies, and the like. 

20 Variations of this basic scheme are known in the art. 

If the targeted pilC sequences are expected to 
be present at relatively low levels, air^lif ication may be 
required for their detection. Such techniques are known 
in the art. For example, the Enzo Biochemical 

25 Corporation "Bio-Bridge" system uses terminal 
deoxynucleotide transferase to add unmodified 
3' -poly- dT- tails to a DNA probe. The poly dT-tailed 
probe is hybridized to the target nucleotide sequence, 
and then to a biotin-modif ied poly-A. PCT application 

30 84/03520 and EPA124221 describe a DNA hybridization 
assay in which: (1) analyte is annealed to a 
single- stranded DNA probe that is complementary to an 
enzyme- labeled oligonucleotide; and (2) the resulting 
tailed duplex is hybridized to an enzyme -labeled 
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oligonucleotide. EPA 204510 describes a DNA 
hybridization assay in which analyte DNA is contacted 
with a probe that has a tail, such as a poly-dT tail, an 
amplifier strand that has a sequence that hybridizes to 
5 the tail of the probe, such as a poly- A sequence, and 
which is capable of binding a plurality of labeled 
strands. A particularly desirable technique may first 
involve amplification of the target pilC sequences. The 
target pilC sequences in sera may be amplified, for 

10 example, to approximately 10^ sequences/ml. This may be 
accomplished, for example, by the polymerase chain 
reactions (PGR) technique described by Saiki et al. 
(1986), by Mullis, U.S. Patent No. 4,683,195, and by 
Mullis et al. U.S. Patent No. 4,683,202. Amplification 

15 may be prior to, or preferably subsequent to purification 
of the pile target sequence. For exait^le, amplification 
may be utilized in conjunction with the assay methods 
described in U.S. Patent No. 4,868,105, or if even 
further amplification is desired, in conjunction with the 

20 hybridization system in EPO Publication No. 317,077. 

Generally, in the PGR technique, short 
oligonucleotide primers are prepared which match opposite 
ends of a desired sequence. The sequence between the 
primers need not be known. A sample of polynucleotide is 

25 extracted and denatured, preferably by heat, and 

hybridized with oligomers which are oligonucleotide 
primers, which are present in molar excess. 
Polymerization is catalyzed by a template- and primer - 
dependent polymerase in the presence of deoxynucleotide 

30 triphosphates (dNTPs) , and may also be in the presence of 
nucleotide analogs. This results in two "long products" 
which contain the respective primers at their 5' -termini, 
covalently linked to the newly synthesized complements of 
the original strands. The replicated DNA is again 
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denatured, hybridized with oligonucleotide primers, 
returned to polymerizing conditions, and a second cycle 
of replication is initiated. The second cycle provides 
the two original strands, the two long products from 
5 cycle 1, and two "short products" replicated from the 
long products. The short products contain sequences 
(sense or antisense) derived from the target sequence, 
flanked at the 5'- and 3' -termini with primer sequences. 
On each additional cycle, the niomber of short products is 

10 replicated exponentially. Thus, this process causes 
amplification of a specific target sequence. 

It will be understood that "primer", as used 
herein, may refer to more than one primer, particularly 
in the case where there is some ambiguity in the 

15 information regarding the terminal sequence (s) of the 
target region to be amplified. Hence, a "primer" 
includes a collection of primer oligonucleotides 
containing sequences representing the possible variations 
in the sequence or includes nucleotides which allow a 

20 typical base pairing. One of the primer oligomers in 
this collection will be homologous with the end of the 
target sequence. 

The amplified sequence (s) may then be detected 
using a hybridization assay which utilize nucleic acid 

25 multimers which bind to single- stranded analyte nucleic 
acid, and which also bind to a multiplicity of 
single- stranded labeled oligonucleotides. A suitable 
solution phase sandwich assay which may be used with 
labeled polynucleotide probes, and the methods for the 

30 preparation of probes is described in EPO 225,807, 
published June 16, 1987. 

The probes can be packaged into diagnostic 
kits. Diagnostic kits include the probe DNA, which, may 
be labeled; alternatively, the probe DNA may be unlabeled 
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and the ingredients for labeling may be included in the 
kit in separate containers. The kit may also contain 
other suitably packaged reagents and materials needed for 
the particular hybridization protocol, for example, 
5 standards, buffers, as well as instructions for 
conducting the test using the kit ingredients. 

The pile DNA secpience information in the clones 
described in the Examples may be used to gain further 
information on the remaining sequence of the pllC gene 
10 from meningococci, for other possible alleles of pilC in 
Neisseria r as well as pllC in other relevant genuses and 
species. This information will aid in the 
characterization of the gene, and of its role in 
virulence of the pathogenic forms of microorganisms, 
15 including, for exaii?)le. Neisseria, Pseudomonas , 

Bacteroldes , Moraxella, and Vibrio. Moreover, this 
sequence information can lead to additional 
polynucleotide probes, polypeptides derived from pllC, 
multiple pile loci, and antibodies directed against PilC 
20 epitopes which would be useful for the diagnosis and/or 

treatment of infections caused by the relevant pathogenic 
microorganisms . 

The DNA sequence information in the 
above-mentioned clones is useful for the design of probes 
25 for the isolation of additional DNA sequences which are 
derived from as yet undefined regions of plIC. For 
example, labeled probes containing a sequence of 
approximately 8 or more nucleotides, and preferably 20 or 
more nucleotides, which are derived from regions close to 
30 the ends of the DNA sequences shown in the Exaxxtples. 
These probes may be used to isolate overlapping DNA 
sec[uences within or adjacent to pllC from DNA libraries 
created from genomes of species having type 4 pilins . 
The resulting overlapping DNAs may then be used to 
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.synthesize probes for identification of other overlapping 
fragments which do not necessarily overlap the DNAs whose 
sequences are given in the Examples. Thus, it is 
possible to sequence entire pilC genes utilizing the DNA 
5 sequences provided herein and the technique of isolation 
of overlapping DNAs derived from the pilC genes. 

Methods for constructing DNA libraries are 
known in the art, and are discussed infra; for example, a 
method for the construction of pilC libraries in 

10 lambda-gtll is discussed infra in Section IV. A. However, 
DNA libraries which are useful for screening with nucleic 
acid probes may also be constructed in other vectors 
known in the art, for example, larabda-gtlO {Huynh et al . 
(1985)). Another suitable vector for the creation of 

15 libraries may be EMBL3 , which is a replacement vector 
which accepts inserts ranging from 9 to 23 kb in size. 
In general, methods for constructing DNA libraries is 
discussed in Maniatis et al, MOLECUIiAR CLONING, 2nd 
edition, (1989) . 

20 The sequence information derived from these 

overlapping pilC DNAs is useful for determining areas of 
homology and heterogeneity within the pllC gene(s), which 
could indicate the presence of different strains 
gonococci, meningococci, or other hitherto unrecognized 

25 pathogenic forms of Neisseria. It is also useful for the 
design of hybridization probes to detect PilC antigens or 
pile nucleic acids in biological samples. Moreover, the 
overlapping DNAs may be used to create . expression vectors 
for polypeptides derived from pilC gene(s) . 

3 0 The pile DNA sequence information may also 

allow the construction of additional bacteriostatic 
agents for treatment of neisserial infections, in that 
they may block the expression of PilC and/or pilin 
assembly. For example, it may be used to derive 
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antisense polynucleotides. Antisense polynucleotides 
molecules are comprised of a complementary nucleotide 
sequence which allows them to hybridize specifically to 
designated regions of genomes or RNAs. Antisense 
5 polynucleotides may include, for example, molecules that 
will block protein translation by binding to mRNA, or may 
be molecules which prevent replication of DNA by 
replicase. They may also include molecules which carry 
agents (non-covalently attached or covalently bound) 

10 which cause the mRNA or genomic DNA to be inactive by 
causing, for example, scissions in these molecules. 
Antisense molecules which are to hybridize to pllC 
derived polynucleotides may be designed based upon the 
sequence information of the pllC DNA secpjences provided 

15 herein, including those which would be isolated from 

additional DNA libraries. The antibacterial agents based 
upon anti-sense polynucleotides for pilC may be designed 
to bind with high specificity, to be of increased 
solubility, to be stable, and to have low toxicity. 

20 Hence, they may be delivered in specialized systems, for 
example, liposomes, or by gene therapy. In addition, 
they may include analogs, attached proteins, substituted 
or altered bonding between bases, etc. 

Other types of drugs may be based upon 

25 polynucleotides which "mimic" important control regions 
of the pile gene, and which may be therapeutic due to 
their interactions with key components of the system 
responsible for expression of the gene. 

In addition to the specific methods described 

3 0 in the Exanqples, general methods are known which may be 
used in the practice of the invention. For example, 
general techniques used in extracting the genome from 
bacteria, including Neisseria, preparing and probing a 
DNA library, sequencing clones, constructing expression 
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vectors, transforming cells, performing immunological 
assays such as radioimmunoassays and ELISA assays, for 
rowing cells in culture, and the like are known in the 
rt and laboratory manuals are available describing these 
5 techniques. However, as a general guide, the following 
sets forth some sources currently available for such 
procedures, and for materials useful in carrying them 
out . 

Both prokaryotic and eukaryotic host cells may 

10 be used for expression of desired coding sequences when 
appropriate control sequences which are compatible with 
the designated host are used. Among prokaryotic hosts, 
E. coli is most frequently used. Expression control 
sequences for prokazyotes include promoters, optionally 

15 containing operator portions, and ribosome binding sites. 
Transfer vectors compatible with prokaryotic hosts are 
commonly derived from, for example, pBR322, a plasmid 
containing operons conferring simpicillin and tetracycline 
resistance, and the various pUC vectors, which also 

20 contain sequences conferring antibiotic resistance mark- 
ers. These markers may be used to obtain successful 
transf ormants by selection. Commonly used prokaryotic 
control sequences include the Beta-lactcimase 
(penicillinase) and lactose promoter systems (Chang et 

25 al. (1977)), the tryptophan (trp) promoter system 
(Goeddel et al. (1980) ) and the lambda -derived 
promoter and N gene ribosome binding site (Shimatake et 
al. (1981)) and the hybrid tac promoter (De Boer et al. 
(1983)) derived from sequences of the trp and lac UV5 

3 0 promoters. The foregoing systems are particularly 

compatible with E. coli-, if desired, other prokaryotic 
hosts such as strains of Bacillus or Pseudomonas may be 
used, with corresponding control sequences. 
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Eukaryotic hosts include yeast and mammalian 
cells in culture systems. Saccharomyces cerevislae and 
5acci2ax-oinj^ces carlsberg-ensls are the most commonly used 
yeast hosts, and are convenient fungal hosts. Yeast 
5 compatible vectors carry markers which permit selection 
of successful transformants by conferring prototrophy to 
auxotrophic mutants or resistance to heavy metals on 
wild- type strains. Yeast compatible vectors may employ 
the 2 micron origin of replication {Broach et al. 

10 (19 83)), the combination of CEN3 and ARSl or other means 
for assuring replication, such as sequences which will 
result in incorporation of an appropriate fragment into 
the host cell genome. Control sequences for yeast 
vectors are known in the art and include promoters for 

15 the synthesis of glycolytic enzymes (Hess et al. (1968); 
Holland et al. (1978)), including the promoter for 3 
phosphoglycerate kinase (Hitzeman (1980)). Terminators 
may also be included, such as those derived from the 
enolase gene (Holland (1981) ) . Particularly useful 

20 control systems are those which con5)rise the 

glyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter 
or alcohol dehydrogenase (ADH) regulatable promoter, 
terminators also derived from GAPDH, and if secretion is 
desired, leader sequence from yeast alpha factor. In 

25 addition, the transcriptional regulatory region and the 
transcriptional initiation region which are operably 
linked may be such that they are not naturally associated 
in the wild- type organism. These systems are described 
in detail in EPO 120,551, published October 3, 1984; EPO 

30 116,201, published August 22, 1984; and EPO 164,556, 
published December 18, 19 85. 

Mammalian cell lines available as hosts for 
expression are known in the art and include many 
dlmmortalized cell lines available from the American Type 
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Culture Collection (ATCC) , including HeLa cells, Chinese 
hamster ovary (CHO) cells, baby hamster kidney (BHK) 
cells, and a number of other cell lines. Suitable 
promoters for mammalian cells are also known in the art 
5 and include viral promoters such as that from Simian 

Virus 40 (SV40) (Fiers (1978)), Rous sarcoma virus (RSV) , 
adenovirus (ADV) , and bovine papilloma virus (BPV) . 
Mammalian cells may also require terminator sequences and 
poly A addition sequences; enhancer sequences which 

10 increase expression may also be included, and sequences 
which cause amplification of the gene may also be 
desirable. These sequences are known in the art. 

Vectors suitable for replication in mammalian 
cells are known in the art, and may include viral 

15 replicons, or sequences which insure integration of the 
appropriate sequences encoding PilC epitopes into the 
host genome. 

A vector which is used to express foreign DNA, 
and which may be used in vaccine preparation is Vaccinia 

20 virus. In this case the heterologous DNA is inserted 

into the Vaccinia genome. Techniques for the insertion 
of foreign DNA into the vaccinia virus genome are known 
in the art, and utilize, for example, homologous 
recombination. The insertion of the heterologous DNA is 

25 generally into a gene which is non-essential in nature, 
for example, the thymidine kinase gene (tk) , which also 
provides a selectable marker. Plasmid vectors that 
greatly facilitate the construction of recombinant 
viruses have been described (see, for example, Mackett et 

30 al. (1984), Chakrabarti et al. (1985); Moss (1987)), 
Expression of the polypeptide containing at least one 
immunoreactive PilC epitope then occurs in cells or 
individuals which are immunized with the live recombinant 
vaccinia virus • 
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Other systems for expression of desired 
polypeptides include insect cells and vectors suitable 
for use in these cells. These systems are known in the 
art, and include, for example, insect expression transfer 
5 vectors derived from the baculovirus Auizo^rapha 

callfomica nuclear polyhedrosis virus (AcNPV) , which is 
a helper- independent, viral expression vector. 
Expression vectors derived from this system usually use 
the strong viral polyhedrin gene promoter to drive 

10 expression of heterologous genes. Currently the most 
commonly used transfer vector for introducing foreign 
genes into AcNPV is pAc373. Many other vectors, known to 
those of skill in the art, have also been designed for 
improved expression. These include, for example, pVL9 85 

15 (which alters the polyhedrin start codon from ATG to ATT, 
and which introduces a BamHI cloning site 32 basepairs 
downstream from the ATT; See Luckow and Sximmers (1989)) . 

Methods for the introduction of heterologous 
DNA into the desired site in the baculovirus virus are 

20 known in the art. (See Summer and Smith, Texas 

Agricultural Experiment Station Bulletin No. 1555; Ju et 
al. (1987); Smith et al- (1983); and Luckow and Summers 
(1989)) . For example, the insertion can be into a gene 
such as the polyhedrin gene, by homologous recombination; 

25 insertion can also be into a restriction enzyme site 
engineered into the desired baculovirus gene. The 
inserted sequences may be those which encode all or 
varying segments of the polyprotein, or other orfs which 
encode viral polypeptides. . For example, the insert could 

30 encode the following numbers of amino acid segments from 
the polyprotein: amino acids 1-1078; amino acids 332-662; 
ainino acids 406-662; ainino acids 156-328, and amino acids 
199-328. 
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The signals for posttranslational 

modifications, such as signal peptide cleavage, 

proteolytic cleavage, and phosphorylation, appear to be 

recognized by insect cells. The signals required for 

5 secretion and nuclear accumulation also appear to be 

conserved between the invertebrate cells and vertebrate 

cells. Examples of the signal sequences from vertebrate 

cells which are effective in invertebrate cells are known 

in the art, for example, the human interleukin 2 signal 

10 (IL2 ) which is a signal for transport out of the cell, 
s 

is recognized and properly removed in insect cells. 

Recombinant polynucleotides are inserted into 
host cells by transformation. Transformation may be by 
any known method for introducing polynucleotides into a 

15 host cell, including, for example packaging the 

polynucleotide in a virus and transducing a host cell 
with the virus, and by direct uptake of the 
polynucleotide. The transformation procedure used 
depends upon the host to be transformed. Bacterial 

20 transformation by direct uptake generally employs 
treatment with calcium or rubidium chloride (Cohen 
(1972); Maniatis (1989)). Yeast transformation by direct 
uptake may be carried out using the method of Hinnen et 
al. (1978) . Mammalian transformations by direct uptake 

25 may be conducted using the calcium phosphate 

precipitation method of Graham and Van der Eb (1978) , or 
the various known modifications thereof. Other methods 
for the introduction of recombinant polynucleotides into 
cells, particularly into mammalian cells, which are known 

30 in the art include dextran-mediated transf ection, calcium 
phosphate mediated transf ection, polybrene mediated 
transf ection, protoplast fusion, elect roporat ion, 
encapsulation of the polynucleotide (s) in liposomes, and 
direct microinjection of the polynucleotides into nuclei. 
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The recombinant polynucleotide may be in the 
form of a vector. Vector construction employs techniques 
which are known in the art. Site-specific DNA cleavage 
is performed by treating with suitable restriction 
5 enzymes under conditions which generally are specified by 
the manufacturer of these commercially available enzymes . 
In general, about 1 microgram of plasmid or DNA sequence 
is cleaved by 1 \init of enzyme in about 20 microliters 
buffer solution by incubation of 1-2 hr at 37*^ C. After 

10 incubation with the restriction enzyme, protein is 
removed by phenol/chloroform extraction and the DNA 
recovered by precipitation with ethanol. The cleaved 
fragments may be separated using polyacrylamide or 
agarose gel electrophoresis techniques, according to the 

15 general procedures found in Methods in Enzymology (19 80) 
65.:499-560. 

Sticky ended cleavage fragments may be blunt 
ended using E. coll DNA polymerase I (Klenow) in the 
presence of the appropriate deoxynucleotide triphosphates 

20 (dNTPs) present in the mixture. Treatment with SI 

nuclease may also be used, resulting in the hydrolysis of 
any single stranded DNA portions . 

Ligations are carried out using standard buffer 
and temperature conditions using T4 DNA ligase and ATP; 

25 sticky end ligations rec[uire less ATP and less ligase 

than blunt end ligations. When vector fragments are used 
as part of a ligation mixture, the vector fragment is 
often treated with bacterial alkaline phosphatase (BAP) 
or calf intestinal alkaline phosphatase to remove the 

30 5 '-phosphate and thus prevent religation of the vector; 
alternatively, restriction enzyme digestion of unwanted 
fragments can be used to prevent ligation. 

Ligation mixtures are transformed into suitable 
cloning hosts, such as E. coll, and successful 
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transformants selected by, for example, antibiotic 
resistance, and screened for the correct construction. 

The desired recombinant DNA sequences may be 
synthesized by synthetic methods. Synthetic 
5 oligonucleotides may be prepared using an automated 
oligonucleotide synthesizer as described by Warner 
(1984) . If desired the synthetic strands may be labeled 
with ^^P by treatment with polynucleotide kinase in the 
presence of "^^P-ATP, using standard conditions for the 
10 reaction, 

DNA sequences, including those isolated from 
DNA libraries, may be modified by known techniques, 
including, for example site directed mutagenesis, as 
described by Zoller (1982) . Briefly, the DNA to be 

15 modified is packaged into phage as a single stranded 

sequence, and converted to a double stranded DNA with DNA 
polymerase using, as a primer, a synthetic 
oligonucleotide complementary to the portion of the DNA 
to be modified, and having the desired modification 

20 included in its own sequence. The resulting double 

stranded DNA is transformed into a phage supporting host 
bacterium. Cultures of the transformed bacteria, which 
contain replications of each strand of the phage, are 
plated in agar to obtain plaques. Theoretically, 50% of 

25 the new plaques contain phage having the mutated 
sequence, and the remaining 50% have the original 
sequence. Replicates of the plaques are hybridized to 
labeled synthetic probe at temperatures and conditions 
which permit hybridization with the correct strand, but 

30 not with the unmodified sequence. The sequences which 
have been identified by hybridization are recovered and 
cloned. 

DNA libraries may be probed using the procedure 
of Grunstein and Hogness (1975) . Briefly, in this 
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procedure, the DNA to be probed is immobilized on nitro- 
cellulose filters, denatured, and prehybridized with a 
buffer containing 0-50% formamide, 0.75 M NaCl, 75 mM Na 
citrate, 0.02% (wt/v) each of bovine serum albumin, poly- 
5 vinyl pyrollidone, and Ficoll, 50 mM Na Phosphate (pH 
6.5), 0.1% SDS, and 100 micrograms /na carrier denatured 
DNA. The percentage of formamide in the buffer, as well 
as the time and temperature conditions of the 
prehybridization and siibsequent hybridization steps 

10 depends on the stringency required. Oligomeric probes 
which require lower stringency conditions are generally 
used with low percentages of formamide, lower 
temperatures, and longer hybridization times. Probes 
containing more than 30 or 40 nucleotides such as those 

15 derived from cDNA or genomic sequences generally employ 
higher temperatures, e.g., about 40-42 C, and a high 
percentage, e.g., 50%, formamide. Following 
prehybridization, 5' -^^P-labeled oligonucleotide probe is 
added to the buffer, and the filters are incubated in 

20 this mixture under hybridization conditions. After 
washing, the treated filters are subjected to 
autoradiography to show the location of the hybridized 
probe; DNA in corresponding locations on the original 
agar plates is used as the source of the desired DNA. 

25 For routine vector constructions, ligation 

mixtures are transformed into B. coli strain HBlOl or 
other suitable host, and successful transf ormants 
selected by antibiotic resistance or other markers. 
Plasmids from the transf ormants are then prepared 

30 according to the method of Clewell et al. (1969), usually 
following chloramphenicol amplification (Clewell (1972) ) . 
The DNA is isolated and analyzed, usually by restriction 
enzyme analysis and/or sequencing. Sequencing may be by 
the dideoxy method of Sanger et al. (1977) as further 
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described by Messing et al . (1981), or by the method of 
Maxam et al. (19 80) . Problems with band compression, 
which are sometimes observed in GC rich regions, were 
overcome by use of T-deazoguanosine according to Barr et 
5 al, (1986). 

An enzyme- linked immunosorbent assay (ELISA) 
can be used to measure either antigen or antibody 
concentrations. This method depends upon conjugation of 
an enzyme to either an antigen or an antibody, and uses 

10 the bound enzyme activity as a quantitative label. To 
measure antibody, the known antigen is fixed to a solid 
phase (e.g., a microplate or plastic cup), incubated with 
test serum dilutions, washed, incubated with 
ant i- immunoglobulin labeled with an enzyme, and washed 

15 again. Enzymes suitable for labeling are known in the 
art, and include, for example, horseradish peroxidase. 
Enzyme activity bound to the solid phase is measured by 
adding the specific substrate, and determining product 
formation or substrate utilization colorimetrically , The 

20 enzyme activity bound is a direct function of the amount 
of antibody bound. 

To measure antigen, a known specific antibody 
is fixed to the solid phase, the test material containing 
antigen is added, after an incubation the solid phase is 

25 washed, and a second enzyme -labeled antibody is added . 

After washing, substrate is added, and enzyme activity is 
estimated colorimetrically, and related to antigen 
concentration . 

Examples 

30 Described below are examples of the present 

invention which are provided only for illustrative 
purposes, and not to limit the scope of the present 
invention. In light of the present disclosure, numerous 
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embodiinents within the scope of the claims will be appar- 
ent to those of ordinary skill in the art- 

5 Isolation of PilC 

An outer membrane preparation from 
N. gonorrhoeae strain MSllj^^CP"*") contains small amounts 
of a 110 kd protein, PilC. This protein was enriched 
during alternate cycles of crystallization and 

10 solubilization of pili, unlike other outer membrane 

proteins that decreased in abundance by this procedure. 

The materials and methods used for the 
isolation procedure were the following. 
Bacterial strains and growth condi tions 

15 N. gonorrhoesLe MSll^ (Meyer et al., 1984) and 

P" and P'n variants of MSllj^j^ (Swanson et al., 1986) were 
kindly obtained from Dr. M. So and from Dr. M. Koomey, 
respectively. The gonococcal isolates UMOl and KH4318 
have previously been described (Norlander et al., 1981). 

20 gonorrhoeae strains 605344 and 605103 were obtained 

from Dr. D. Danielsson, Orebro, Sweden, and strain 765 
was isolated at the Department of Bacteriology in Umea, 
Sweden. The commensal Neisseria species N. lactamica 
Nctc 10618 and N. sub f lava GNOl were obtained from 

25 Pharmacia, Uppsala, Sweden. These bacteria were grown at 
37 °C in a 5% CO 2 atmosphere on Difco GCB agar containing 
Kellogg' s supplement. Piliated (P"*") and nonpiliated (P*) 
variants were distinguished by colony morphology and 
passed as single colonies. E. coli strain Y 1090 

3 0 (obtained from Promega Biotech) was used for plaque 

screening, DH5 (Hanahan, 1985) for molecular cloning, 
AAIO recA (Stoker et al. 1984) for isolation of minicells 
and TGI (Gill et al., 1986) for propagation of M13 
clones . 
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Preparation of pili and outer membranes 

Pili were prepared essentially as described by 
Brinton et al- (1978) . Gonococci (P'*"Tr) from 80 GGB 
5 plates, grown for 18 h, were harvested in 0,05 M Tris- 

HCl pH 8.0 and 0 . 15 M NaCl , washed twice and resuspended 
in 40 ml 0.15 M ethanolamine pH 10.5. Pili were sheared 
off in a Sorvall Omnimixer, setting 3 for 30 s. The cell 
debris was pelleted at 13,000 g for 30 min at 4°C and the 

10 supernatant was dialyzed against 0,05 M Tris-HCl pH 8 . 0 
and 0.15 M NaCl. The crystallized pili were pelleted at 
13,000 g for 60 min., resuspended in 0.15 M ethanolamine 
pH 10.5, and centrifuged at 23,000 g for 60 min. The 
supernatant was dialyzed as described above against 0 . 05 

15 M Tris-HCl pH 8.0 and 0 . 15 M NaCl. Several cycles of 
crystallization and solubilization were performed to 
produce pili preparations with high purity- Outer 
membranes of N. gonorrhoeae were prepared by the sarkosyl 
method described by Norquist et al* (1978) . 

20 

Example 2 

Preparation of Purified Anti-PilC Ant ibodies 

The 110 kd protein present in purified MSll^^^g 
pili preparations was eluted from SDS polyacrylamide gels 
25 and rabbit antibodies were generated against the' gel 
purified protein* The antiserum cross reacted 
extensively with the pilin protein in immunoblots and was 
therefore absorbed with extracts of Pseudomonas putida 
expressing the pilin subunit of N. gonorrhoeae on plasmid 
30 pGC02. 

Pili preparations of gonorrhoeae MSll^^CP"^) 
crystallized 5 times were separated on 10% SDS -poly- 
acrylamide gels using the buffer system of Laemmli 
(1970) . These gels were stained in 0.25 M KCl and 1 mM 
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DTT for 5 min, , the 110 kd protein band was sliced out, 
crushed and incuJDated in a buffer containing 0.05 M Tris- 
HCl pH 7.9, 0.1 inM EDTA, 5 mM DTT and 0.15 M NaCl at 4°C 
overnight. Gel pieces were removed by centrifugation 
5 prior to immunization of rabbits . 

The achieved 110 kd- antiserum was extensively 
absorbed with Pseudomonas putida, 2440 (Bagdasarien et al . 
(1983), carrying a recombinant plasmid, pGC02, 
constructed as follows- The 1.0 kb Hpal-ScoRI fragment 

10 of the pilus gene clone pNGllOO (Meyer et al. 1984) 

obtained from M. So was cloned into the Hpal and EcoRX 
sites of pMMB66 (Furste et al., 1986). The pilE gene is 
then under control of the tac promoter and induction with 
1 mM IPTO resulted in high levels of pil in ^produced in P. 

15 putxda, 2440, but no extracellular pili structures were 
obseirved. Dense sonicated cultures of P. putida 
2440/pGC02 were mixed in a 1:1 ratio with the crude 
antiserum. About 15 cycles of 1 h incubation and 30 min 
centrifugation at 25,000 g in the presence of 1 mM PMSF 

20 (phenylmethylsulfonylfluoride) at 4<'C were performed. 

The pili antiserum used in immunoblots was 
generated in a rabbit against highly purified pili 
preparations of J7. gronorriioeae MSll^^j^^. 

In immunoblots 10 fig of boiled bacterial cells 

25 or the same amount of outer membranes were 

electrophoresed on 10% SDS-polyacrylamide gels. The 
proteins were transferred from the gel onto 
nitrocellulose sheets where their immunological cross - 
reaction with the 110 kd absorbed antiserum was tested 

30 using an immunoblotting protocol as described by Towbin 
et al. (1979) . 
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Example 3 

Specificity of Purified Anti-PilC Antibodies 

The absorbed antiserum was used in immunoblots 
with whole cell extracts of a number of N. gonorrhoeae 
5 strains as well as commensal strains of Neisseria (Figure 
2A) . All strains of N, gonorrhoeae, except strain 
605103, contained one or two high molecular weight 
protein species reacting with the antiserum. Strain 
605103, unlike the other strains tested, was nonpiliated 

10 and no piliated variants could be obtained suggesting 
that it is a P"n variant (Swanson et al., 1985)- This 
was confirmed by Southern blot hybridization using an 
oligonucleotide probe corresponding to the 5' end of the 
pllE gene. No hybridization was obtained with this 

15 probe. The commensal lactamica Nctcl0618, but not N. 
subflava GNGl, contained a high molecular weight protein 
reacting with the 110 kd antiserum. Immunoblots against 
outer membrane preparations of P"*" and P"n MSllj^j^ showed 
the 110 kd protein to be present in the outer membrane in 

20 both of these MSll variants. 

Southern blot hybridization was accomplished as 
follows. Digested genomic DNA was separated on 0.7% 
agarose gels and transferred to nitrocellulose filters 
(Southern, 1975) . After transfer and baking the filters 

25 were prehybridized in a mixture of 5 x SSC, 0.1% SDS, 
5 mM EDTA, 5 x Denhardt's solution and 100 fig/ml of 
sonicated calf thymus DNA at 65°C for 2-6 h. -^^P-labeled 
probe (multiprime DNA labelling system, Amersham 
International) was added and hybridization was performed 

30 for 12-15 h at the same temperature. The filters were 
washed in 2 x SSC with 0.1% SDS and in 0.2 x SSC with 
0.1% SDS for 2 X 15 min each, dried and exposed to Kodak 
XRP film at -80°C. 
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A 21-base-long oligonucleotide complementary to 
the signal peptide coding region of p±lE (5'- 
GCCTTTTTGAAGGGTATTCAT-3 ' ) was -^^P-labeled with T4 
polynucleotide kinase and used to probe Clal- digested 
5 genomic DNA. The blot was prehybridized at 37^C in a 

mixture containing 2 x Denhardt's, 0.1% SDS, 2^5 mM EDTA, 
5 X SSC and 100 fig /ml sonicated calf thymus DNA, 
hybridized at 37 °C and washed in 2 x SSC for 5 min. 
MSlljjjj^CP**") gave a 4 kb hybridization fragment, whereas 
10 MSllj^{P"n) and 605103 gave no hybridization signal. 

Example 4 
Molecular clonincr of the p llCl aene 
encoding a 110 kd protein 

15 Chromosomal DNA from N, g-onorrhoeae MSllj^(P'^) 

was used to construct a Xgtll library* The library was 
screened with the absorbed 110 kd antisertim and one 
positive clone out of 10,000 plaques was found, 
containing an 800 bp insert. A lysogen of this positive 

20 Xgtll clone was examined in immunoblots and a fusion 

protein with an estimated size of 150 kd reacted with the 
antiserum (data not shown) • The 800 bp insert was 
purified, labeled with ^^P, and used as a probe to screen 
a plasmid library from N. gonorrhoeae MSllj^^. Six clones 

25 out of 10,000 hybridized with the probe. Restriction 

maps for these partially overlapping six clones are shown 
in Figure 1. 

In Figure 1, plasmids pABJ04-09, which all 
belong to locus 1, were isolated from a plasmid library 

30 using the 800 bp insert from Xgtll as a probe. The Xgtll 
insert (from locus 2) has an additional SalT site not 
found in the plasmid clones. The position of the pilCl 
gene and direction of its transcription (indicated . by an 
arrow) were determined in E. coll minicells. Three 
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thickened lines, with an arrow-head at each end, indicate 
fragments used as probes in Southern hybridizations . 
I.e., the 800 bp insert from Xgtll, the EcoRV^'EcoRV2 
(l.Skb) and the BcoRVg -Hindlll^ (0-8 kb) fragments of 
5 pAGJ04. Triangles mark the location of two mTnCm 
insertions in pABJ04. The resulting plasmids, 
pABJ)4: :mTnCm-12 and pABJ04 : imTnCm- 14 were used, were 
used to inactivate pilCl and pilC2. 

The six plasmid clones, pABJ04-09, were 

10 transformed into the minicell producing strain AAIO to 
monitor expression of plasmid encoded [ S] methionine 
labeled proteins. The E. coli minicell strain AAIO was 
transformed with plasmid DNA (pABJ04-09) and chromosome 
deficient minicells from these strains were purified over 

15 sucrose gradients (Thompson and Achtman, 1978) • The 

plasmid- encoded proteins were labeled in the presence of 
80 fiC± [-^^S] methionine in minimal salts medium and 1% 
methionine assay mediiim (Dif co) . After lysis of the 
minicells in sample buffer (Laemmli, 1970) the proteins 

20 were electrophoresed on an SDS-polyacrylamide gel, the 

gel was dried and exposed to X-ray film (Kodak X-OmatAR) . 

Plasmid pABiJ04 expressed minute amounts of 
three high molecular weight proteins, 113, ill and 108 kd 
in size, as well as a number of lower molecular weight 

25 protein species not produced from the vector control. 
The three high molecular weight bands were missing in 
pABJOS and pABJOe but three novel lower molecular weight 
protein species had appeared, suggesting that pABJOS and 
pABJ06 are deleted for the 3/ end of a gene, denoted 

30 pilCl, and that this gene is responsible for all three 
high molecular weight species. This suggested that the 
distal end of the gene must be located between the Mlul^ 
and MUII2 sites (Figure 1) . The observation that plasmid 
pABJ07 did not express any high molecular proteins 

35 
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tentatively located the 5' end of the gene to a region 
0.5-1.2 kb to the right of the EcoRV^ site. The size for 
a gene encoding a 110 kd protein is -3 kb which is in 
agreement with these mapping data. 

5 

Identification of a Second Gene Encoding PilC 

The 800 bp insert in Xgtll contains a single 
Sail site not present in the region on pABJ04 which 

10 hybridized to this fragment, suggesting that there is 

more than one pilC locus in the genome of N. gonorrhoeae 
MSll. This was confirmed in Southern blot hybridizations 
in which three different pxlC fragments were used to 
probe Snzal and Clal digested genomic DNA. The 800 bp 

15 fragment from Xgtll hybridized in a Southern blot to two 
Clal (18 and 8 kb) and Smal (13 and 4.5 kb) fragments of 
DNA prepared from gonorrhoeae MSllj^^j^. Since the probe 
does not contain any internal Clal or anal sites, there 
are presumably two copies of the 3' end of pllC in the 

20 MSll genome. The 1.3 kb EcoRV^ - EC0RV2 fragment of 

pABiJ04 carries the central region of pilCl. This probe 
hybridized to the same two Clal fragments and to four 
Smal fragments, two of which are the same size as the two 
Smal fragments identified with the 800 bp probe (13 kb 

25 and 4.5 kb) . Hybridization with the 800 bp probe was 
more extensive to the 8 kb Clal and the 4 kb Smal 
fragment whereas the reverse was found with the 1.3 kb 
-ScoRV^ - SC0RV2 fragment from pABJ04 strongly suggesting 
that the two genomic copies of pllC show a significant 

30 sequence variation in the 3' as well as in the central 

region. A probe corresponding to the 5' region of pllCl 
was also used in Southern hybridization experiments . 
This 0.8 kb Hindlll^ - EcoRV^ fragment hybridized to two 
Clal (18 kb and 4 kb) and Smal (25 kb and 7 kb) fragments 

35 
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with seemingly equal efficiency. The hybridization 
pattern was identical using DNA from MSIIj^g. Taken 
together these hybridization data indicate that 
N. gonorrhoeae MSll contains two complete copies of pilC. 
5 Furthermore the two genes appear to be more homologous in 
their 5' as compared to their central and 3' regions. 

The results indicate that the 800 bp insert 
from Xgtll carries information from pilC2 whereas the 
clones pABJ04-09 must carry information from pilCl. 

10 Finally pilC2 must be located >2 kb from either end of 
pilCl. The DNA sequence of the 3 '-end of the p±lC2 
fragment is shown in Figure 7. The sequence showing the 
putative amino acids encoded therein are shown in Figure 
8, A comparison of the analogous portions of pilC2 (top) 

15 and pilCl (bottom) DNA sequences, and the putative amino 
acids encoded therein are shown in Figure 9- 

The 800 bp fragment from p±lC2 was also used to 
probe digested genomic DNA from N. gonorrhoeae strains 
UMOl, 765 and 605103. The latter isolate does not 

20 express detectable levels of the 110 kd protein. Strain 
UMOl, unlike MSll, contained only one Clal fragment of 15 
kb that hybridized to the probe (data not shown) . Hence, 
this strain may contain only one copy of pilC. Strain 
605103 and 765, on the other hand, each seem to contain 

25 two copies of pilC since two Clal and two Smal fragments 
hybridized to the 800 bp probe. 

The commensal N. lactamica Nctcl0618 DNA 
digested with Clal and Smal also hybridized with the 800 
bp probe. Since only one band hybridized in each case 

30 this strain may contain only one copy of pilC. In 

contrast, N. subflava GNOl did not hybridize to the 800 
bp pilC2 probe using the same stringency. 
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Example 6 
Characterization of the r>±lC Genes 

The pilCl gene on pABJ04 is translatio nally out of frame 
5 The amino terminal sequence of gel purified 110 

kd protein from strain MSll^^^g (P"^) was determined by 
sequential Edman degradation. For aminoterminal sequence 
determination automated Edman degradations (Edman and 
Bregg, 1967) were performed in an updated Beckman 89 0C 

10 spinning cup sequencing sequencer. The sequencing 

procedure and the method for analysis of the 3 -phenyl -2- 
thiohydantoin derivatives been described (Engstrom et 
al., 1984). Considerable difficulties were encountered 
in the method probably due to blocking of the N- terminus. 

15 As a result, only the residues from position 4 to 10 were 
obtained (Figure 2) . 

The 3.3 kb Hindlll^ - Mlul^ fragment 
encompassing the entire pllCl gene was sequenced on both 
strands using the dideoxy sequencing method adapted for 

20 single stranded DNA. 

Purif ied DNA fragments from pABJ04 and PCR- 
air5)lified 5' end of pllCl was subcloned into M13 vectors 
(Sanger et al., 1980: Yanish-Perroa et al., 1985) and 
sequenced using the chain termination method of Sanger et 

25 al* (1977) . Primers used were the M13 17-mer universal 
primer and oligonucleotides synthesized at Symbicon, 
Umea, Sweden or at the Department of Biochemistry, 
Washington University, St., Louis, MO, USA. 

The results of the sequencing showed that the 

30 pilCl contained one single open reading frame of 997 

codons (from left to right in Figure 2) and starting at 
an AUG codon 195 bp from the HindXII^ site. Codons 7-12 
in this open reading frame corresponded to amino acids 5- 
10 in the sequence of the gel purified protein. The AUG 
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codon in the beginning of the long open reading frame was 
not preceded by a typical Shine -Dalgarno sequence. 
Moreover, since the 110 kd protein is located in the 
outer membrane of iff. gronorriioeae, we expected the protein 
5 to be translated with a signal sequence. When examining 
the nucleotide sequence, an AUG codon was found in frame 
1 that was preceded by a typical Shine -Dalgarno sequence 
(-AGGAA-). The sequence following this AUG codon would 
encode a typical signal peptide with basic amino acids in 

10 the amino terminal region and a hydrophobic central 

region. However, no signal peptidase cleavage site could 
be predicted following the rules of von Heijne (19 83) • A 
tract of 12G residues was found in the region encoding 
the putative signal peptide for PilC* Addition of one G 

15 residue or the loss of two would, align the long open 
reading frame with the AUG codon in frame 1. The 
translated region in frame 2 contains a putative signal 
peptidase cleavage site between Ala and Gin. A cleavage 
at this site would align the determined amino acid 

20 sequence at positions 5-10 for the 110 kd protein with 
the deduced amino acid sequence. The data therefore 
suggested that the cloned pilCl gene is out of frame due 
to f rameshif ting in the region encoding the signal 
peptide. 

25 Figure 2 shows the nucleotide secjuence and the 

deduced amino acid sequence of the 5' -end of pilC. The 
amino- terminal sequence of gel purified PilC from 
MSllj^CP"*") is shown in a box below frame 2, a 997 amino 
acid long open reading frame that would code for a 

30 protein about 110 kd in size. Frame 1 contains 41 amino 
acids and is preceded by a putative Shine -Dalgarno 
sequence (underlined) . Two horizontal lines mark a 
stretch of 12 G residues. An addition of one G in this 
region would align the ATG (boxed) in frame 1 with frame 

35 
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2- Numbers above the sequence show base positions 
relative to the HindlXX^ site (=0) located on pABJ04. 
The position of two 24 bp oligonucleotide primers 
(opposite stands) used for PGR amplification, are 
5 indicated above the sequence by hatched bars. 

Figure 3 shows the nucleotide sequence of the 
sense strand of the pilCl gene* 

Figure 4 shows the shows the nucleotide 
sequence of the sense strand of the pllCl gene and the 
10 amino acids encoded therein. 

Genetic inactivation of pilC2 but n ot pllCl 
abolishes expression of the 110 kd prote in in MSll 

Plasmid pABJ04 was mutagenized in E. coll by a 

15 transposon mini-Tn3 derivative, mTnCm. The shuttle 

mutagenesis system developed by Seifert et al., (1986) 
using a miniTn3 carrying the chloramphenicol resistance 
gene was kindly provided by Dr, M. So. Mutagenesis of 
pABiJ04 with mTnCm and transformation of N. g-onorrhoea.e 

20 were performed as previously described (Seifert et al., 
1990) . MiniTnCm insertions at 30 different positions in 
pAB*J04 were identified, two of which mapped within the 
Pile gene. Filiated N. gonorrhoeaB MSll^^^j^ were 
transformed with 2 /zg plasmid DNA, transf ormants were 

25 selected for on plates containing 10 ^xg/ml 

chloramphenicol for the single mutants and 3 0 /xg/ml 
chloramphenicol for the double mutants . 

Only two mTnCm insertions had occurred in 
pllCl (Figure 1) . Truncated protein species were seen in 

30 minicells with the mTnCm-14 insertion located 0.5 kb from 
the 3' end of pilCl but not with the mTnCm-12 insertion 
located 0,5 kb from the 5' end of the gene. Both 
insertion mutants were used in a gene replacement 
experiment- Plasmids pABJ04 : :mTnCm-12 and pABJ04 : :mTnCm- 
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14 were linearized with BamHI and transformed into 
N. gonorrhoeae MSllj^{P'^) and transf ormants resistant to 
10 /xg/ml of chloramphenicol were selected. Forty- eight 
P"^ transf ormants (24 from each experiment) were assayed 
5 for the presence of PilC in immunoblots. All these 

transf ormants remained capable of expressing the PilC 
protein. Genomic DNA was prepared from seven of the 
chloramphenicol resistant transf ormants (five from 
pABJ04: :mTnCm-12 and two f rom pABJ04 : :mTnCm-14) , cleaved 

10 with Clal and PvuII and used in Southern blot experiments 
using the EcoRV^ - EC0RV2 fragment of pABJ04 as a probe. 
The 8 Icb Clal fragment was unaffected in the mutants 
whereas the 18 kb Clal fragment had been replaced by a 20 
kb fragment . PvuII cleaves within the 1.6 kb mTnCm 

15 element. The probe detected an 8 kb PvuII fragment in 
both parent and mutant DNA. In the mutants, a novel 
PvuII fragment appeared that was 6.2 kb in size in five 
transf ormants obtained with pABJ04 : rmTnCm- 12 and 4.8 kb 
in size in two transf ormants with pABJ04 : :mTnCm-14 . To 

20 confirm the insertion of mTnCm, a 250 bp EcoRI-Hindlll 
fragment of the CAT GenBlock (Pharmacia, Sweden) , 
containing the PvuII site, was used as a probe. It 
detected the larger of the two Clal fragments as well as 
the 6.2 kb PvuII fragment. In addition, a 2 kb PvuII 

25 fragment not covered with the pilC probe was detected . 
These data demonstrate that we have obtained gene 
replacements in pilCl, whereas p±lC2 was unaffected in 
all seven P"*", PilC"*" transf ormants . A rapid hybridization 
was done to screen the remaining 41 P"** transf ormants . 

3 0 All but one had mTnCm inserted in pilCl. The remaining 
trans formant had an intact locus 1 and 2 and must 
therefore contain mTnCm elsewhere in the gonococcal 
chromosome. 
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In the same transformation experiments, the 
frequency of P" colony variants was about five- fold 
higher as compared with that occurring normally in strain 
MSllj^ (?■*■) , Two P'mTnCm-12 transf ormants isolated at 10 
5 fig/rtiL of chlorairphenicol were also analyzed by Southern 
blot hybridization using the EcoRV^ - EC0RV2 fragment of 
pABJ04 and the EcoRI - Hindlll fragment of the CAT 
GenBlock. Each of these mutants carried mTnCm in p±lC2 
as evidenced by a replacement of the 8 kb Clal fragment 

10 by a fragment 9.5 kb in size that hybridizes to both 
probes* These pil C2 : :mTnCm insertion mutants did not 
express PilC as determined by immunoblot analysis. 

A P"*", pilCl: :mTnCm- 12 mutant was retransf ormed 
with DNA prepared from a P", pilC2: :mTnCm- 12 mutant and 

15 colonies growing at 3 0 /zg/ml of chloramphenicol were 

selected to obtain double mutants in pllC. All resistant 
transf ormants were P" , and when analyzed by Southern blot 
hybridization all contained mTnCm in both pllCl and 
pilC2. Electron microscopy revealed that the P"*", 

20 pilCl: :mTnCm- 12 mutant still expressed pili albeit at a 
slightly lower level than the MSll^^^j^ (P"*") parental clone, 
whereas the P~, pilC2: :mTnCm- 12 was completely bald as 
was the pilCl, p±lC2 double mutant. 

Immunoblot analyses were performed on the P"*" 

25 pilCl: :mTnCm- 12 mutant, the P", pilC2: :mTnCm- 12 mutant 
and the P", pilCI : :mTnCm- 12 , pilC2: imTnCm- 12 double 
mutant, using PilC and pili antisera. Inactivation of 
pilCl did not abolish expression of PilC or the pilin. 
Inactivation of pilC2 totally abolished e3<pression of 

30 PilC but did not affect expression of pilin. The pilCl, 
p±lC2 double mutant was PilC" but produced only low 
levels of pilin. Taken together these data imply that 
p±lC2 but not pllCl is expressing PilC in the MSll 
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variant under study. Moreover, inactivation of pilC2 but 
not pilCl was associated with a loss of piliation. 

P"*" revertants occurred spontaneously at a low 
frequency in the piIC2: zmTnCm- 12 mutants. These 
5 revertants expressed pili as determined by electron 

microscopy and also expressed PilC. It is likely that 
Pile expression is due to in- frame switching in pilCl. 

Figure 5 shows the nucleotide sequence of the 
sense strand of the pllCl gene, and the effect of frame 
10 shift on the putative gene products encoded therein. 

The pile genes of N. gonorrhoeae vary 

in the length of the G tract 
Polymerase chain reaction (PGR) with Tag 
15 polymerase was used to analyze the 5' region of pilC 

using two 24 base long synthetic oligonucleotides based 
on the sequence of pilCl (Figure 2) . These 
oligonucleotides would generate an amplified fragment of 
149 bases as judged from the sequence obtained from 
20 pABJ04. 

Polymerase chain reaction was carried out in 
100 ^1 containing 50 ng of genomic DNA or 5 ng of plasmid 
DNA- 1,0 fjLM of each oligonucleotide, 200 /xM of each 
nucleotide, 0.001% gelatin, 1.5 mM MgCl2/ 10 mM Tris (pH 

25 8.3), 50 mM KCl, 0.25 fil 1 mCi/ml ["^^PldATP and 2 U of 
Taq Polymerase (Perkin Elmer Cetus) . The samples were 
passed through 25 cycles: 2 min at 50 ^C/ 1 min at 94 °C 
and 3 min at 72 in a Thermal Cycler (Perkin Elmer 
Cetus) . Aliquots of the DNA fragments were denatured at 

3 0 95 °C for 2 min and electrophoresed on standard denaturing 
sequencing gels. 

The amplified products from MS11^(P"*") DNA were 
149 and 150 long respectively. In addition, two less 
abundant products of 151 and 148 bases were seen. The 
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amplif ied products were electroeluted and cloned into 
MlBmpia, and twenty phage clones were sequenced using a 
universal primer. Four different sequences were obtained 
(Figure 6) . 

5 Figure 6 shows the nucleotide sequence of PGR 

amplified fragments demonstrating a variation in the 
length of the G tract and sequence differences in the 5 ' - 
region of the pilC genes. The two oligonucleotide 
primers used for the PGR are shown in Figure 2 . 

10 Amplified DNA was cloned into M13mp8 and sequenced . 

Shown are the complete nucleotide sequence in between the 
two primers. In- frame secjuences are translated and the G 
stretches are underlined. The putative cleavage sites 
are marked with arrows. Genomic DNA from N. gonorrheas 

15 strains MSlKP'^.PilG"*') , UMOl (P'^.PilC"*') , 765 (P"*" . PilG"**) 
and 605103 (P'n.PilG"), and purified DNA from 
PABJ04/AA10 (recA) was used in the PGR. 

Variant patterns la and lb were identical to 
each other and to the cloned sequence on pABJ04 except 

20 for the presence of 11 instead of 12 G residues in the G 
tract of lb. The G tract of sequence 2a was 13 residues 
long indicating that the sequence is in frame. In 
addition, this sequence differed from pilCl by four 
basepair substitutions outside the G tract, including an 

25 AAA lysine codon four triplets downstream of the putative 
signal peptide processing site which is in agreement with 
the lysine residue found in the fourth position of the 
gel purified 110 kd PilG protein. Sequences la and lb 
contained CAA, the codon for Gin, at the same position. 

30 Sequence 2b was identical to 2a except for the presence 
of 12 G residues in the G tract. These data are 
compatible with sequence 1 being from pllCl and sequence 
2 from pilC2 and further support that p±lC2 must be the 
expressed locus in the MSll (P"*") variant we are studying. 
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S train UMOl apparently only contains one copy 
of pile. DNA from this strain generated five amplified 
fragments ranging in size from 148 to 152 bp in the PGR 
reaction. The most abundant fragments were 149-151 bp 
5 long. Among ten M13 clones, three variant 1 sequences 
were found (a,b,c) that differed only in the number of G 
residues (11-13) in the G tract (Figure 6) supporting the 
hybridization data that this strain contains only one 
pile gene* Since a PilC protein is expressed from UMOl 
10 we suggest that the majority of cells has 13 Gs in the G 
tract . 

Strain 765 contains two pilC loci, both of 
which seem to be trans la tionally ON based on the presence 
of two high molecular weight proteins reacting with the 

15 absorbed PilC antiserum. A number of amplified fragments 
were seen after the PGR reaction ranging in size from 149 
to 153 bases. Three variant sequences were found among 
nine clones (Figure 6) . The G tract was 13 residues long 
in variant 3a (in frame) and 14 (out of frame) in variant 

20 3b whereas sequence variant 4 contained 11 G residues in 
the G tract. Variant sequence 4 contained four 
additional nucleotides (-CAGG-) distal to the G tract 
relative to variant sequences 1, 2 and 3, indicating that 
the amplified product with 11 Gs from this variant 

25 sequence is 152 long and out of frame. Two PGR amplified 
products 152 and 153 in length were obtained from strain 
765 suggesting that in frame variants of sequence 4 might 
be present in the DNA prepared from this strain. 

Strain 605103 carries two pilC copies, both of 

30 which seem to be translationally OFF. The amplified 

fragments were 148 and 149 bases in size. Out of eight 
M13 clones only variant la and lb sequences were found, 
with 11. and 12 Gs in the G tract respectively. 
Consequently, we were unable to find an in frame sequence 
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variant from this strain. We do not know if the 5' ends 
of the two pile genes are identical in this strain or if 
one pile gene differed from pilCl in the region 
corresponding to the oligonucleotides used for 
5 amplification. In the latter case we would not expect to 
obtain any amplified products from the second copy. 

The only in frame variant found in DNA 
amplified from N, gonorrhoeae carried 13 Gs in the G 
tract. To see if variants with lOGs arise in products 

10 expressed in E. coll, PGR amplified products were 

generated from pABJ04 purified from coll strain AAIO, 
using the same two oligonucleotide primers as before. 
Out of 12 sequenced clones, two carried 10 Gs in the G 
tract (Figure 6) . The majority of clones (seven) carried 

15 12 Gs as expected. It is likely that the PGR 

amplification products are not representative of the 
original DNA population. However, the distribution of 
variation in the G tract is consistent with a model in 
which only one G residue is gained or lost at one given 

20 event. Since AAIO is recA, frameshift mutations in the G 
tract in E. coll occur independently of the RecA protein. 

N. meningitidis contains two pllC loci 
Southern blot hybridizations using MSll pllCl' specific 

25 probes identified multiple fragments when meningococcal 
genomic DNA is digested with a variety of restriction 
endonucleases . PGR amplification using two 24 -base 
oligonucleotides from the 5' end of MSll pllCl as primers 
yields multiple fragments ranging in size from 148 to 151 

30 bases. DNA sequencing of fragments cloned into phage M13 
identifies two classes of sequences, as in N. 
gonorrhoeae, which differ outside the G- tract. Variation 
occurred within each class with respect to the number of 
G's in the G- tract. Therefore, N, meningitidis must 
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carry two pllC loci, as does N. gonorrhoeae, which should 
also be under the control of translational frame 
shifting. 

Both pile loci are cloned from N. mening-itidis 
5 by generating an EMBL3 library and screening this library 
with pilCl' and pilC2' specific DNA from N. gonorrhoeae 
strain MSll. 

Genomic DNA from meningitidis is partially 
digested with Sau3A and fragments ranging from 9 to 20 kB 

10 are ligated into the lambda EMBL3 vector. Because of the 
packaging constraints of the phage, only those lambdas 
which contain DNA fragments of this size will be packaged 
(i.e,, are viable). The library thus constructed can be 
screened with genomic oligonucleotide or cloned gene 

15 probes following selection in a lysogenic E. coli strain. 
(Frischart, A.M. et al (1983), J. Mol. Biol. 170:827). 
Preferably, full length clones are identified by 
screening for clones hybridizing to both the 5' and 3' 
ends of pilC. If full length clones cannot be obtained 

20 from the EMBL3 library, pilC specific probes may be used 
to screen a plasmid library from the same strains . 

Translational fusion proteins with /3- 
galactosidase may also be screened for in a Xgtll 
library, using /3- galactosidase and PilC specific 

25 antisera in Western immunoblots. )8-galactosidase- PilCl 
and )S-galactosidase-PilC2 fusion proteins are purified 
from the cytoplasm of recombinant E. coli and used to 
raise specific antisera. 

30 Example 7 

The immunobioloaical protpert ies of PilC 
PilC is located in the outer membrane of 
Neisseria, The immune response during natural infection 
can be assessed by screening convalescent sera for anti- 
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PilC antibody- The presence of only two pllC loci 
suggests that Pile is only moderately variable, however. 
This together with its essential role in pilus biogenesis 
makes PilC attractive as a potential vaccine candidate. 
5 Two types of PilC translational fusions using 

alkaline phosphatase and jff-galactosidase are generated. 
In the first instance a secreted fusion protein is 
obtained that may associate with the outer membrane. In 
the second instance the fusion proteins may accumulate in 

10 the cytoplasm as inclusion bodies. The construction 

schemes for such fusion proteins uses techniques known in 
the art. TnphoA insertions on plasmid pABiJ04 in E. coll 
are generated, and a PhoA"*" phenotype is screened for as 
blue colonies on media containing the chromogenic 

15 substrate XP. If such clones have the phoA gene in frame 
with an in frame variant of pilCl the fusion product 
should be able to cross the cytoplasmic membrane where it 
can be analyzed by Western immunoblots -using an alkaline 
phosphatase specific antiserum and our PilC antiserum 

20 raised against gel-purified PilC2 from MSlKP***). 

ZracZ: zpllC fusions are generated by cloning different 
segments of pllC into a lacZ' containing vector used to 
generate translational fusions. Similar constructs are 
performed on each of the two pilC genes from N. 

25 menincfltzldls . Antisera are generated against fusion 
proteins after their purification using conventional 
protocols. These antisera are extensively adsorbed with 
extracts of E, coll expressing alkaline phosphatase and 
/S-galactosidase, and used in Western immunoblots and 

3 0 EIiISA assays against a panel of Neisseria, gonorrhoeae and 
Neisseria menlng-ltldls strains. Antisera raised against 
fusion proteins carrying the major portion of PilC are 
also analyzed in Western blots using E. coll expressing 
fusion proteins containing only smaller regions of PilC. 
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The results of these studies should show which regions in 
Pile are immunodominant. 

The pllCl and p±lC2 genes are highly homologous in their 
5' ends whereas the homology is considerably less 
5 pronounced in the central and 3' region. 

In addition, the entire pllC2 gene from N. 
gonorrhoeae MS11(P'*"} is cloned and sequenced. Algorithms 
are used to search for potential T-cell epitopes 
(amphipathic helical conformation) and /S-cell epitopes. 
10 Polypeptides containing the predicted epitopes are tested 
to determine if they can prime mice for an enhanced 
immune response to PilCl and PilC2 . 

Specific Pile antisera are used in 
immunoelectromicroscopy with piliated Neisseria cells as 
15 well as with purified pili to see if PilC is physically 
connected with the pilus fiber. 

Neisseria is grown in the presence of different 
dilutions of PilC" specific antibodies. Bactericidal 
effect exerted by the antiserxim, effects on piliation, 
20 and effects on bacterial attachment to corneal primary 
culture cells are monitored. Binding assays to 
epithelial cells are described in Tjia, K.F. et al. 
(1988), Graefe's Arch. Clin. Exp. Opthalmol. 226:341- 
345. 

25 

Example 8 

Identification and characterization of genes 
located adjacent to pilC 

30 The pilCl and pilC2 loci are part of a larger duplication 
that extends both 5'- and 3'- of pilC. We know from our 
work with E. coli that strains may contain multiple gene 
clusters for the same class of pili. In one case we have 
shown that the only difference between two duplicated 
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gene clusters (pap and prs) resides in the adhesion genes 
such that each cluster gives rise to serologically 
identical pili binding to different cell surface 
receptors . 

5 mTncm mutagenesis in the region upstream and 

downstream of pllCl is performed to generate allelic 
replacements in the pllCl and p±lC2 regions on the 
chromosome. Since the two regions are highly homologous 
we expect to obtain for each insertion allelic 

10 replacements in either region. Double mutants are 
generated as before by isolating DNA from mutants 
carrying insertions in the p±lC2 region transforming P"^ 
variants carrying the same insertion in the pllCl region 
and select for transf ormants resistant to BO/ig/ml of 

15 chloramphenicol. These double mutants are examined for 
piliation, pilins expression, and binding to corneal 
primary culture cells . 

Example 9 

20 Phase variation in gonococcal pili expr ession 

can be caused by frameshift mutati ons in oilC 
If Pile is required for pilus formation, we 
would expect some P~ progeny arising from a P*^ clone to 
accumulate unassembled pilin in the absence of PilC. 

25 Nonpiliated (P") colonies were derived from MSllj^CP"*") , 

restreaked, and tested for the presence of PilC and pilin 
in immunoblots with the PilC and pili antisera. Five out 
of eight P~ clones did not produce detectable levels of 
PilC, but expressed the pilin subunit. The remaining 

30 three P" clones expressed PilC but not pilin. The 

molecular mass of the pilin subunit was the same in the 
P", PilC" variants as in MSllj^CP"*", PilC"^) . However, the 
former in addition produced a protein reacting with the 
pili antiserum that was 16 kd in size. Since MSll^^ only 
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contains one expression site for pilin we believe that 
this protein species represents a proteolytic degradation 
product of the pilin and may be identical to the S -pilin 
previously described (Haas et al., 1987). Several 
5 independent P" clones were isolated from one P", PilC 
clone. They all remained PilC" and retained expression 
of pilin. Piliated (P"*") revertants were also obtained 
from the same P", PilC" clone. These P'*" revertants 
occurred at about a tenfold lower frequency (10 ) than 

10 P' derivatives from a P"*" clone. All P*^ revertants from a 
P", PilC" clone had regained expression of PilC. All but 
one expressed a pilin with the same molecular weight as 
the nonpiliated parent. However, the low molecular 
weight pilin degradation product was much less abundant 

15 in the P"*", PilC revertants. It was possible to obtain 
P"*" revertants from other P", PilC" clones as well, all of 
which expressed Pile. 

The pilE gene from one set of PilC switches was 
PCR amplified and sequenced directly. The P", PilC, 

20 pilin"^ variant 8 carried eight amino acid changes in the 
pilin relative to the parental clone MSll^^^ The pilin 
sequence of the P"^, PilC" backswitcher 8:1 was identical 
to variant 8. Thus, the backswitching from P"to P"*" 
colony morphology was not associated with any alteration 

25 in the pilus subunit protein implying that the change in 
colonial morphology was due to the switch in PilC 
expression. 

Strain MSllj^{P"), variants 8(P") and 8:1 (P^) 
were also examined by transmission electron microscopy. 
3 0 Electron microscopy was performed with a JEOL lOOCX 

microscope with 200 -mesh copper grids coated with thin 
films of 2% Formvar. The bacterial colonies were 
carefully overlaid with buffer [10 /xM Tris-HCl (pH 7,5), 
10 /xM magnesium chloride] and the cells were allowed to 
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sediment for 15 min on a grid. The grids were washed 
with water, negatively stained with 1% sodium 
silicotungstate (pH 7.0) and then washed again. 

The electron micrographs showed that the 
5 MSIKP"^) parental cells were heavily piliated and pili 
were often seen to aggregate. In contrast most cells of 
variant 8{P") were nonpiliated. One or two pili were 
found on --10% of these cells. All cells of variant 8:1 
(P"*") were piliated, carrying -10-40 fibers/cells. No 

10 aggregation of individual fibers was seen. These data 

confirm that the observed changes in colonial morphology 
reflect alterations in expression of pili. Therefore, 
phase variation of gonococcal pili may not only be caused 
by recombination events occurring in the pilE locus 

15 (Bergstrom et al, .1986; Swanson et al., 1986) but also by 
frameshif t mutations in pllC. 

EX^TTIpl ^ 10 

Immunoaenicity of PilC 
20 In order to predict a region of PilC which 

would have a high probability for antigenicity, residues 
300 to 700 of the putative PilCl protein encoded within 
pllCl were analyzed for antigen index, hydrophilicity, 
and hydrophobicity using standard computer-modelling 
25 methods. The analysis indicated that the PilCl 

polypeptide fragment containing residues 300 to 700 would 
have several regions with a high antigen index, high 
hydrophilicity, and a high likelihood for location in an 
external domain. 
30 The immunogenicity of a recombinant polypeptide 

expressed from the DNA encoding amino acids 300 to 700 
was examined. The region of DNA encoding amino acid 
residues 3 00 to 700 was amplified by polymerase chain 
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reaction (PGR) using the following oligonucleotide 
primers . 

5' GGC TAG GTG GCA TAT GAA AGA TAG GGG 3' 
5 and 

5' TTT GCA ATG GGG GAT CCT * G*A*G GTG TGT TTG 3' 

These primers incorporate an Ndel and a BamHI restriction 
endonuclease site (indicated by the underlined 

10 nucleotides) , respectively. A termination codon 

(indicated by the asterisks) was also incorporated. The 
PGR amplified DNA was then ligated into the vector pET3a 
(between the Ndel and BamHI sites) , The recombinant 
vector pET3a is used in the inducible expression system 

15 described by Studier et al. (1990), using the protocol., 
described therein. Strain BL21 (DE3) was transformed 
with the pET3a-pilC (300-700) vector, and the transformed 
strain used for the expression of the PilC (300-700) 
peptide. 

20 The expression products after induction were 

analyzed by sodium dodecyl sulf ate-polyacrylamide gel 
electrophoresis (PAGE) using an 8% acrylamide gel and the 
standard 25 mM Tris base / 250 mM glycine / pH 8.3 / 0.1% 
sodixim dodecyl sulfate (SDS) electrophoresis buffer. 

25 After electrophoresis, the gel was fixed and then stained 
with Coomassie Blue according to standard protocols, and 
the production of PilC (300- 700) was confirmed by the 
detection of the presence in the gels of an abundant, 
appropriately- sized peptide of approximately 46kD. 

30 In order to detect the immunogenicity of the 

PilC (300-700) product, the region of the SDS-PAGE gel 
containing the PilC (300-700) polypeptide was excised from 
parallel unstained lanes, homogenized, and the protein 
eluted into a buffer of 0.1% Triton X-100 in water by 
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passive diffusion. Rabbits were given a priming 
intradermal injection of homogenized gel slices 
(containing approximately 500/xg PilC (300-700) protein), 
followed 3 weeks later by a subcutaneous boost (of 
5 approximately 500 fig of eluted protein) . An initial test 
serum was then collected after an additional 14 days. 
All of the test animals yielded a specific high titer 
antibody response. The antibodies induced by PilC (300- 
700) were immunologically reactive not only with that 

10 polypeptide (i.e., PilCl (300-700) ) , but also with native 
PilCl and native PilC2, 

The results demonstrate, inter alia, the 
following. PilC contains antigenic epitopes that can 
elicit a strong immunogenic response. At least some of 

15 the immunogenic epitopes are shared (cross -reactive) 

between PilCl and PilC2, despite differences in primary 
amino acid sequence. The technique of subcloning 
discrete portions of the PilC protein under control of an 
inducible promoter allows mapping of antigenic epitopes. 

20 Sufficient quantities of specific oligopeptides of known 
antigenicity can be produced for use in screening the in 
vivo immune response after exposure to the intact 
pathogen. 

The following listed materials are on deposit 
25 under the terms of the Budapest Treaty with the American 
Type Culture Collection (ATCC) , 12301 Parklawn Dr., 
Rockville, Maryland 20852, and have been assigned the 
following Accession Numbers. 

30 Description ATCC No. Deposit Date 

pABJ03 in coli (DH5) 68519 Jan. 28, 1991 

pABiJ04 in E, coli (DH5) 68520 Jan. 28, 1991 
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Upon allowance and issuance of this application 
as a United States Patent, all restriction on 
availability of these deposits will be irrevocably 
removed; and access to the designated deposits will be 
5 available during pendency of the above-nauned application 
to one determined by the Commissioner to be entitled 
thereto under 37 CFR 1.14 and 35 USC 1.22. Moreover, the 
designated deposits will be maintained for a period of 
thirty (30) years from the date of deposit, or for five 

10 (5) years after the last request for the deposit; or for 
the enforceable life of the U.S. patent, whichever is 
longer. The deposited materials mentioned herein are 
intended for convenience only, and are not required to 
practice the present invention in view of the 

15 descriptions herein, and in addition these materials are 
incorporated herein by reference. 

Industrial Applicability 
The invention, in the various manifestations 

20 disclosed herein, has many industrial uses, some of which 
are the following. The pilC DNAs may be used for the 
design of probes for the detection of pllC nucleic acids 
in samples. The probes derived from the DNAs may be used 
to detect pilC nucleic acids in, for example, chemical 

25 synthetic reactions. The polynucleotide probes are also 
useful in detecting viral nucleic acids in humans, and 
thus, may serve as a basis for diagnosis of pathogenic 
microorganisms containing type 4 pilin, for example, 
gonococcal and/or meningococcal infections in humans. 

30 In addition to the above, the DNAs provided 

herein provide information and a means for synthesizing 
polypeptides containing epitopes of PilC. These 
polypeptides are useful in detecting antibodies to PilC 
antigens . A series of immunoassays the relevant 
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neisserial infection, based on recombinant polypeptides 
containing p±lC epitopes are described herein, and will 
find commercial use in diagnosing diseases caused by 
these microorganisms. In addition, the polypeptides 
5 derived from the pilC DNAs disclosed herein will have 

utility as vaccines for treatment of infections caused by 
meningococci and gonococci. 

The polypeptides derived from the pllC DNAs, 
besides the above stated uses, are also useful for 

10 raising anti-PilC antibodies. Thus, they may be used in 
vaccines against the relevant microorganisms. Moreover, 
the antibodies produced as a result of immunization with 
the polypeptides containing an immunoreactive PilC 
epitope are also useful as passive vaccines, or in the 

15 detection of the presence of PilC antigens in samples . 
Thus, they may be used to assay the production of 
polypeptides derived from PilC in chemical systems. The 
anti-PilC antibodies may also be used to monitor the 
efficacy of anti-neisserial agents in screening programs 

20 where these agents are tested in tissue culture systems. 
Another important use for anti-PilC antibodies is in 
affinity chromatography for the purification of PilC 
derived polypeptides. The purified PilC polypeptide 
preparations may be used in vaccines. 

25 For convenience, the anti-PilC antibodies and 

polypeptides containing regions encoded in pilC, whether 
natural or recombinant, may be packaged into kits. 
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CliAIMS 

1. A recombinant polynucleotide encoding a 
polypeptide comprised of an immunoreactive epitope of a 
protein encoded in pilC of Neisseria, 

2. The recombinant polynucleotide of claim 1, 
wherein the immunoreactive epitope is encoded in pilC of 
Neisseria gonorrheae. 

3. The recombinant polynucleotide of claim 2, 
wherein the immunoreactive epitope is encoded in pilC2 of 
Neisseria gonorrheae. 

15 4. The recombinant polynucleotide of claim 2, 

wherein the immunoreactive epitope is encoded in pilCl of 
Neisseria gonorrheae . 

5 . A vector comprised of a recombinant 

2 0 polynucleotide, wherein the recombinant polynucleotide is 
selected from the group consisting of the recombinant 
polynucleotide of claim 1, the recombinant polynucleotide 
of claim 2, the recombinant polynucleotide of claim 3, 
and the recombinant polynucleotide of claim 4. 

25 

6. A host cell transformed with the vector of 

claim 5 • 

7. A recombinant expression system comprising 
30 a polynucleotide encoding a polypeptide comprised of an 

immunoreactive epitope of a protein encoded in pilC of 
Neisseria, wherein the polynucleotide is operably linked 
to a control sequence compatible with a desired host. 
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8. The recombinant: expression system of claim 

7, wherein the immunoreactive epitope is encoded in pilC 
of Neisseria gonorrheae. 

5 9. The recombinant expression system of claim 

8, wherein the immunoreactive epitope is encoded in pilCl 
of Neisseria gonorrheae . 

10 • The recombinant expression system of claim 
10 8, wherein the immunoreactive epitope is encoded in pilC2 
of Neisseria gonorrheae, 

11. A cell transformed with a recombinant 
expression system, wherein the expression system is 
15 selected from the recombinant expression system of claim 
7, the recombinant expression system of claim 8, the 
recombinant expression system of claim 9, and the 
recombinant expression system of claim 10. 

20 12. A polypeptide produced by the cell of 

claim 11. 

13. A purified polypeptide contprised of an 
immunoreactive epitope of a protein encoded in pilC of 

25 Neisseria. 

14. A purified polypeptide according to claim 
13, wherein the immunoreactive epitope is encoded in pilC 
of Neisseria gonorrheae. 



30 



15. A purified polypeptide according to claim 
14, wherein the immunoreactive epitope is encoded in 
pilC2 of Neisseria gonorrheae. 
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16. A purified polypeptide according to claim 
14, wherein the immunoreactive epitope is encoded in 
pilCl of Neisseria gonorrheae. 

5 17. A recombinant polypeptide comprised of an 

immunoreactive epitope of a protein encoded in pllC of 
Neisseria. 

18. A recombinant polypeptide according to 
10 claim 17, wherein the immunoreactive epitope is encoded 

in pllC of Neisseria gonorrheae, 

19 . A recombinant polypeptide according to 
claim 18, wherein the immunoreactive epitope is encoded 

15 in pllC2 of Neisseria gonorrheae, 

20. A recombinant polypeptide according to 
claim 18, wherein the immunoreactive epitope is encoded 
in pllCl of Neisseria gonorrheae, 

20 

21. A method of preparing a recombinant 
polypeptide comprised of an immunoreactive epitope of a 
protein encoded in pllC of Neisseria, the method 
comprising: 

25 a. providing a host cell according to claim 

11; 

b. incubating the cell under conditions which 
allow expression of the recombinant polypeptide; and 

c. isolating the polypeptide. 



30 
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22. A vaccine composition for the treatment of 
Neisseria Infection, comprised of a pharmaceutically 
acceptable excipient and of an effective amount of a 
recombinant polypeptide, wherein the polypeptide is 
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comprised of an immunoreactive epitope of a protein 
encoded in pllC of Neisseria, 

23. The vaccine composition of claim 22, 

5 wherein the immunoreactive epitope is encoded in pllC of 
Neisseria g-onorrheae. 

24. A polypeptide affixed to a solid 
substrate, wherein the polypeptide is selected from the 

10 group consisting of the polypeptide of claim 12, the 

polypeptide of claim 13, the polypeptide of claim 14, the 
polypeptide of claim 15, the polypeptide of claim 16, the 
polypeptide of claim 17, the polypeptide of claim 18, the 
polypeptide of claim 19, and the polypeptide of claim 20. 

15 

25. An immunoassay for detection of anti- 
Nelsserla antibodies comprising: 

(a) providing a sample suspected of containing 
anti --&7eisseria antibodies ; 

20 (b) providing an antigen, wherein the antigen 

is a polypeptide selected from the group consisting of 
the polypeptide of claim 12, the polypeptide of claim 13, 
the polypeptide of claim 14, the polypeptide of claim 15, 
the polypeptide of claim 16, the polypeptide of claim 17, 

25 the polypeptide of claim 18, the polypeptide of claim 19, 
the polypeptide of claim 20; and 

(c) incubating the sample of (a) with the 
antigen of (b) under conditions which allow the formation 
of antibody- antigen complexes; and 

30 (d) detecting the presence of anti- -Neisseria 

antibody -antigen complexes formed in (c) , if any. 

26. A composition comprised of a polypeptide, 
wherein the polypeptide is selected from the group 
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consisting of the polypeptide of claim 17, the 
polypeptide of claim 18, the polypeptide of claim 19, and 
the polypeptide of claim 20. 

5 27, A composition comprised of purified 

polyclonal anti-PilC antibodies, wherein the PilC is of 
Neisseria. 

28. A composition comprised of a monoclonal 
10 antibody directed against an immunoreactive epitope 

encoded in pilC of Neisseria. 

29. An immunoassay for detection of an antigen 
encoded in pilC of Neisseria comprising: 

15 (a) providing a sample suspected of containing 

an antigen encoded in pilC of Neisseria; 

(b) providing a composition comprised of 
antibodies directed against the antigen encoded in pilC 
of Neisseria, wherein the composition is selected from 

20 the group of compositions of claim 27 and claim 28; 

(c) reacting the sample of (a) with the 
antibody containing composition of (b) under conditions 
which allow the formation of anti-PilC antibody- antigen 
complexes ; 

25 (d) detecting anti-PilC antibody- antigen 

complexes formed in (c) , if any. 

30. A kit for analyzing samples for the 
presence of anti-PilC antibodies comprising: 

3 0 (a) an antigen packaged in a suitable 

container, wherein the antigen is a polypeptide selected 
from the group consisting of the polypeptide of claim 12, 
the polypeptide of claim 13, the polypeptide of claim 14, 
the polypeptide of claim 15, the polypeptide of claim 16, 
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the polypeptide of claim 17, the polypeptide of claim 18, 
the polypeptide of claim 19, and the polypeptide of claim 
20; 

(b) a buffer used in the performance of the 
5 analysis, packaged in a suitable container; and 

(c) instructions on the performance of the 
analysis which uses the antigen of (a) and the buffer of 
(b) . 

10 31* A kit for analyzing samples for the 

presence of an antigen comprised of an immunoreactive 
epitope encoded in pilC of Neisseria, comprising: 

(a) a composition comprised of antibodies 
directed against the antigen comprised of an 

15 immunoreactive epitope encoded in pilC of Neisseria, 
wherein the composition is selected from the group of 
compositions of claim 27 and claim 28, wherein the 
composition is packaged in a suitable container; 

(b) a buffer used in the performance of the 
20 analysis, packaged in a suitable container; and 

(c) instructions for performing the analysis. 

32. A method for producing antibodies to PilC 
of Neisseria comprising administering to an individual a 
25 composition comprised of an isolated immunogenic 

polypeptide containing a PilC epitope in an amount 
sufficient to produce an immune response to the PilC 
epitope. 

30 33. An oligomer capable of .hybridizing to a 

sequence in pile of Neisseria, wherein the oligomer is 
comprised of a pilC sequence complementary to at least 
about 6 contiguous nucleotides of pilC. 
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34. An oligomer according to claim 33, wherein 
pile is of Neisseria gonorrheae . 

35. A process for detecting a pilC sequence in 
5 an analyte strand, wherein the pilC sequence comprises a 

selected target region, the process comprising: 

(a) providing a sample comprised of an analyte 
strand suspected of containing a selected target pilC 
sequence; 

10 (b) providing an oligomer capable of 

hybridizing to the target pilC sequence, wherein the 
oligomer is comprised of a pilC targeting sequence 
complementary to at least about 6 contiguous nucleotides 
of pilC; 

15 (c) incubating the sample of (a) with the 

oligomer of (b) under conditions which allow specific 
hybrid duplexes to form between the targeting sequence., 
and the target sequence; and 

(d) detecting hybrids formed between the 

2 0 target sequence, if any, and the oligomer. 

36. The process of claim 35 which further 

comprises : 

(a) providing a set of oligomers which are 

25 primers for a polymerase chain reaction (PGR) method and 
which flank the target region; and 

(b) amplifying the target region via the PGR 

method. 

3 0 3 7. A kit for detecting a p±lC sequence in an 

analyte strand comprising: 

(a) the oligomer of claim 33, packaged in a 
suitable container; 
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(b) a suitable buffer/ packaged in a suitable 



container ; and 



(c) instructions for performing the detection. 



5 



38. A recombinant polynucleotide comprising a 



DNA sequence of at least 8 contiguous nucleotides from 
pile, wherein the pllC sequence is selected from the 
group of sequences shown in Figure 3, Figure 6, and 
Figure 7. 



10 



39. A method of treating an individual for a 



N&isseria infection comprising administering to the 
individual antibodies produced according to claim 32, 
wherein the antibodies are administered in an amount 
15 effective to prevjsnt the pathology of the infection. . 

40. An immunoassay for detection of anti- 
NBissBria antibodies comprising: 

(a) providing a sample suspected of containing 
20 anti-Neisseria antibodies; 

(b) providing an antigen^ wherein the antigen 
is the polypeptide of claim 24; 

(c) incubating the sample of (a) with the 
antigen of (b) under conditions which allow the formation 

25 of antibody-antigen complexes; and 

(d) detecting the presence of anti-Z^eisseria 
antibody-antigen complexes formed in (c) , if any. 



30 



41. A kit for analyzing samples for the 
presence of anti-PilC antibodies comprising: 

(a) an antigen packaged in a suitable 



container, wherein the antigen is a polypeptide from 
claim 24 ; 
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(b) a buffer used in the performance of the 
anaylsis, packaged in a suitable container; and 

(c) instructions on the performance of the 
analysis which uses the antigen of (a) and the buffer of 

5 (b) . 
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10 30 50 

GATCCGCCCGSTGCTTGGGGCCCTTAGGGAACCGTTCCCTTTGAGCCGGGOCGGGGCAAC 

70 90 110 

GCGTACCGaTTTTTGTTAATCCGCTATAAAAGGCGGGCXATAGGGTAGGCTTCA.TCCTG 

130 ISO 170 

CCAATCTCACTGAATCCOTCAATTTCCGCAATTCAATTAAATACCQTCAAACCGATGCCG 

190 210 230 

TCATTCCGCGCAGGCGGGAATCCGGACCGGTCGGGCATCTGCGGCGGTTTGCTAAAAAAC 

250 270 290 

GCTTTACCGTGATAAGTGCGCAAAGTTAAAATGGGGAQGTAAGCTTTTCAATCAGCAATC 

310 330 350 

CGOCGGGCGCGGAATCGGGCGGTTTACCGAACCCCGGCGTTCGCGGCGCCCGTCCCGCGA 

370 390 410 

AaOCAAXCTTAAGGAATAAAATATOAATAAAACTTTGAAACGGCAGGTTTTCCGCCATAC 

430 450 470 

CGCGCTTTATGCCGCCATCTTGAXGTTTTCCCATACCGGCGGGGGGGGGGGGCGATGGCG 

490 510 530 

CAAACCCATCAATACGCTATTATCAXGAACGAGCGAAACCAGCCCGAGGTAAAGCAGAAT 

550 570 590 

GTOCCXTCTTCXXTAAAGGACAAAGACAGGAGOCGCGAAXATACTTATTATACGCACAGA 

610 630 650 

ACAGGAGSAGGCXCTCXCTCAXXCAACAAXAACGAXACCCXXGXXXCCCAACAAAGCGGT 

670 690 * 710 

ACXGCCGTTTXTGGCACAGCCACCTACCXGCCGCCCXACGGCAAGGXTTCCGGXTTTGAT 

730 750 770 

GCCGTCGCXCTSAAAGAGCGCAACAATGCCGXXGAXXGGAXXCGXACCACCCGCATCGCG 

790 eiO S30 

CXOGCAGGCXACXCCXACAXCGACGXCAXAXGCAGAAGCTACACAGGCXGXCCCAAACXT 

850 870 690 

GXCXAXAAAACCCGAXXTACCXXCGGXCAACAAG6GXXGAAAAGAAAGGCAGGCAGCAAG 

910 930 950 

CXOGAXAXAXACGAAGACAAAAOCCOCGAAAAXXCaCCCAXXTACAAAXXQICGOAXXAX 

970 990 1010 

CCXXGGXXGGGCGTATCTTTCAAXTXaOaCAGCGAGAAXACCGXCCAAAAXAGCAAATXA 

1030 1050 1070 

TXCAACAAATTGAXAXCTTCXXXXAGAGAAGGCAAXAAXAAXCAAACCAXCGXCTCTACG 

1090 1110 1130 

ACAOAAGGCAACCCXATXTCCCTXGGCGACCGGCAGCGCGAACAXACCGCCaTGGCCTAT 

1150 1170 1X90 

TAXCXGAACGCCAAACXGCACCTGCXGGACAAAAAAGGGAXXGAAOAXAXCGCCCAAGGC 

1210 1230 3ri50 

AAAAXAGTGGAXXXGGGTAXCXXGAAACCGCACGXCGAGACGACAGGAC6AAGCTTGCTA 

1270 1290 1310 

GAXXXXXGGGCXAGGXGGGACAXXAAAGAXACCGGGCAGAXXCCGGXCAAGCXCGGCCXG 

1330 1350 1370 

CCGCAAGXCAAAGCAGGCCGCXGCACCAACAAACCGAACCCCAATAAXAAXACCAAAGCC 

1390 1410 1430 

CCTXCGCCGGCACXGACCGCCCCCGCGCXGXGGXXCGGACCCGGGCAAGAXGGXAAGGCG 

1450 1470 1490 

GAGAXGXAXTCCGCXXCGGXXXCCACCXACCCCGACAGXXCGAGCAGCCGCAXCXXCCTC 

1510 1530 1550 

CAAGACCXGAAAACXCAAACCGAACCCaaCAAACCCGGCCGCXATTCCCXCAAATCXXTG 

1570 1590 1610 

AAXOAXGGXQAGAXXAAAAGXCGACAGCCGAGXXXCAACGGGCGGCAAACAAXCATCCGA 

1630 1650 1670 

XXGGATOACGOCCXACAXXXGAXCAAACTGAAXGGAAGCAAGGAXGAGGXCGCCGCXTTr 

1690 1710 3730 

GXCAAXTXAAAXGGAAACAACACCGGCAAAAACGACACXTTCGGCAITGXTAAGGAAGCG 

1750 1770 1790 

AACQTCAAXCTTGACGCCGACGAGXGGAAAAAAQXGCXGCXGCCXXGGACGGTTCGGGGX 

laiO 1B30 1850 

CCCGAXAAXGACAAXAAAXXXAAAXCAAXXAACCAAAAACCAGAAAAAXACAGCCAAAGA 

1870 1890 1910 
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TACCSCATCCGCGACAACJJlCCQCAATCGCSATTTGGGCGACATCCnrCAilCATC 

1930 1950 1970 

GTCQCaaTCGaCGGGTXTrrTGGCAACCGCCGCGAHCOACQGGATGGTaCATATCTTCXAA 

1990 2010 2030 

AAAAACGGCGGCAaTGATGAACGCAGCTACAATCTGAAGCTCAGCTACATCCCCGGCACG 

2050 2070 2090 

ATGCCSCGCAAGGATATTCAAAGCCAAGAATCCACCCTTGCCAAASAGCTGCGCGCCTTT 

2110 2130 2150 

GCCGAAAAAGGCTATGTGGGCGACCGCTACGGCGXaGACGQCGGCTTTGTCTTGCGCCAA 

2170 2190 2210 

GTCGAACTGAOCGGGCAAAAACACGTOTTTATGTTCGGCGCGATGGOTTTTGGCGGCAGG 

2230 2250 2270 

GGCGCGTATCCCTTGGATTTAAGCAAAATCAACGGAAATTArrCCGGCCGCCGCCCCCCTG 

2290 23X0 2330 

TTTGATGTCAAAGATGGCGATAATAACGGCAAAAATCGCGTGAAAGTGGAATTAGGCTAC 

23S0 2370 2390 

ACCGTCGGTACGCCGCAAATCGGCAAAATCCGCAACGGCAAATACGCCGCCTTCCTCGCC 

2410 2430 2450 

TCCOGTTATGCGGCTAAAAAAATTGACGACTCAACAAATAAAACCOCGCTOTATG'rATAT 

2470 2490 2S10 

QATTTGAAAGACACCTTAGGTACGCCGATTGCAAAAATCGAAGTQAAOGACGGCAAAGGC 

2530 2550 2570 

GOOCTTTCGTCCCCCACGCTGGTGGATAAAGATTTGGACGGCACGaTCGATATCGCCXAT 

2590 2610 2C30 

QCCGGCaACCQGGGCQGCAATATGTACCGCTTTGATTTGAGCAATTCCGAXTCTAGTAAA 

2650 2€70 2690 

TGGTCTGCAAAGGTTATTTTCGAAGGCGACAAGCCGATTACCTCCGCGCCCQCCGTTTCC 

2710 2730 2750 

CGACTGGCAGACAAACGCGTCGTCATCTrCOGTACGGQCAGCGATTTSACCGAAGATGAT T 

2770 2790 2aiO 

OTACTGAATACOGGCG AACAAT ATATTT ACGGT ATCTTTG ACG ACG ATAAGGGG ACGQTT ^ . 

2830 2850 2870 

AAGGTAACGGTACAAAACGGCACGGCAGGCGGGCTGCTCGAGCAACACCTTACTCAGGAA : 

2890 2310 2930 

AATAAAACATTATTCCTGAACAAGAGATCCGACGGTTCGGGCAGCAAGGGCTGGGCGQTG v 

2950 2970 29f>0 

AAXTTOAGOOAAOGAOAACOCQtTACCGTCAAACCGACCGTGGTATTGCGTACCGCCTTC 

3010 3030 3050 

GTAACCATCCGCAAATATAACGACGGCGGCTGCGGCGCGGAAACCGCCATTTTGGGCATC 

3070 3090 3110 

AATACCQCCGACQGCGGCGCATTCACTCCGAGAAGCGCGCGCCCGATTGTGCCGGATCAC 

3130 3150 3170 

AATTCQGTTGCGCAATATTCCGOCCATAAGACAACCTCCAAAGGCAAATCCATCCCTATA 

3190 3210 3230 

GOTTGTATGGACAAAGACGGTAAAACCGTCTGCCCGAACGGATATGTTTACGACAAGCCG 

3250 3270 3290 

GTTAATGTGCCTTATCTGGATGAAACGGAAACAGACGGATTTTCAACGACGGCGGACGGC 

3310 3330 33S0 

GATGCQGGCGOCAGCGGTATAGACCCCGCCGGCAGGCGTCCCGGCAAAAACAACCGCTGC 

3370 3390 3410 

TTCTCCAAAAAAGGGGTGCGCACCCTGCTGATGAACGATTTGGACAGCTTGGATATTACC 

3430 3450 3470 

OQCCCGATGTGCGGTATCAAACGCTTAAGCTGGCGCGAAGTCTTCTTCTGACCGGCCTGC 

3490 3510 3330 

GCGGCCGGTTTTTCCGCAAATGCCGTCCGAAAGGCCTTCGGACGGCATTTTTTTGCGTTT 

3550 

TTCGGGAGGGGGGCGGCAAATGAAACG 
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10 30 50 

GATCCGCCCGGTGCTTGGGGCCCrTAGGGAACCGTTCCCTTTGAGCCGGGGCGGaCCAAC 

70 90 110 

GCOTACCGGTTTTTGTTAATCCGCTATAAAACGCGGGCTATAGGGTAGGCTTCATUCTa 

130 ISO I'^O 

CCAATCTCACTOAATCCGTCAATTTCCGCAATTCAATTAAATACCCTCAAACCOATGCCG 

190 210 230 

TCATTCCOCGCAGGCGGGAATCCGGACCOGTCGGGCATCtrGCGGCGaTrrGCTAAAAAAC 

250 270 290 

GCTTTACCGTGATAAGXGCGCAAAQTTAAAATGGGGAGGTAAGCTTTTCAATCAGCAATC 

310 330 350 

CGGCGGOCGCGGAATCGGGCGGTTTACCGAACCCCGGCaTTCGCGGCGCCCGTCCCGCGA 

370 390 410 

AGGCAAACTTAAGGAATAAAATATGAATAAAACTTTGAAACGGCAGGTUrTTCCGCCATAC 

KMKTI-KRQVPRHT 
430 450 470 

COCGCTTTATGCCGCCATCTTGATQTTTTCCCATACCGGCQQGGGQGGGGGGGCGATGGU 
Atj^AAI LMrSHTQGaGGAMA 
490 510 530 

GCAAACCCATCAATACQCTATTATCATGAACGAGCGAAACCAGCCCGAGGTAAAGCAGAA 
OTHQYAt rMNBRNQP£VKQN 
550 570 590 

TGTGCCATCTTCAATAAAGGACAAAGACAGGAGGCGCGAATATACTTATTATACGCACAG 
VPS3IKDKDRRRBYTYYTHR 
610 630 650 

AACAGGAQGACGCTCTGTCTCATTCAACAATAACCATACCCTTQTTTCCCAACAAAGCGG 
TGGG$VSrNNNDTI*VSQOSG 
670 690 710 

TACTGCCGTTXTTGGCACAGCCACCTACCTGCCGCCCTACGGCAAGCiXTTCCGGTTTTGA 
TAVrGTATV LPPYGKVSGiru 
730 750 770 

TOCCGTCGCTCTGAAAGAGCGCAACAATGCCGTTGATTGGATTCGTACCACCCGCATCGC 
AVALKBRNNAVDWIRTTRIA 
790 010 830 

GCTGGCAGGCTACTCCTACATCGACGTCATATGCAGAAGCTACACAGGCTGTCCCAAACT 
LAOVSY IDVICRSYTGCPKL 
B50 870 890 

TOTCTATAAAACCCGATXTACCTTCGGTCAACAAGGGXTGAAAAGAAAGGCAQGCAGCAA 
VyKTRrTPG QQGLKRKAGSK 

910 930 950 

GCTCGATATATACGAAGACAAAAGCCGCGAAAATTCGCCCATTTACAAATTGTCGGATTA 
LDIVEDKSRENSPirKLSPY 

970 990 1010 

TCCTTGGTTGGGCGTATCTTTCAATTTGGGCAGCGAOAATACCGTCCAAAATAGCAAATT 
PWIiOVSrNIiGSBNTVQN^KL 
1030 1050 1070 

ATTCAACAAATTGATATCTTCTTTTAGAGAAGGCAATAATAATCAAACCATCGTCTCTAC 
FNKLISarREGNNNQTlVST 
1090 1110 1130 

GACAGAAGGCAACCCTATXrCCCTTGGCGACCGGCAGCGCGAACATACCGCCGTGGCCTA 
TEON?lSIiGDRQREHTAVAY 
1150 1170 1190 

TXAXCTGAACGCCAAACTGCACCXCCXCGACAAAAAAGGGATXGAAGATATCGCCCAAGG 
YliNAKlrHLLiDKKGIB'DIAQG 
1210 1230 1250 

CAAAAXAGXGGATTTGGGTATCTXGAAACCGCACGXCCAGACUACAGGACCAAGCXTGCT 
KIVDLGILKPMVEXXGRSLL 
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1270 1290 1310 

AOATTTTTQQGCTMaTaGGACATTAAAGATACCGGGCAGATTCCaaTCAAGCTCGGCCT 
DFWARWDIKDTGQIPVKLGL 
1330 1350 1370 

GCCGCAAQTCAAAaCAQGCCGCTGCACCAACAAACCGAACCCCAATAATAATACCAAAGC 
PQVKAGRCTMKPNPMNNTKA 
1390 1410 1430 

CCCTTCGCCGGCACTGACCGCCCCCQCGCTGTGGTTCGGACCCGGGCAAGATGGTAAGGC 
PSPALTAPALiWFGPGQDGKA 
1450 1470 1490 

GGAGATGTATTCCGCTTCGGTTTCCACCTACCCCGACAGTTCGAGCAGCCGCATCTTCCT 
EMVSXSVSTYPDSSSSRIFL 
1510 1330 1550 

CCAAGAGCTGAAAACTCAAACCGAACCCGGCAAACCCGGCCOCTATTCCCTCAAATCTTT 
OfiliKTQTEPGKPGRTSLKSL 
1570 15d0 1610 

GAATGATGGTGAGATTAAAAGTCGACAGCCGAGTTTCAACGGGCGGCAAACAATCATCCG 
NDOEIKSRQPSrNGRQTI IR 

1630 1650 1670 

ATTOQAXGACOGCGTJlCATTTQATCAAACTaAATGGAAGCAAGGATaAGGTCGCCGCTTT 
LDD.OVHLIKIiNGSKDBVAAF 
1690 1710 1730 

TGTCAA777AAATGGAAACAACACCGGCAAAAACGACACTT7CGGCATTGTTAAGGAAGC 
VNLNGNN TGKNDTFG tVKEA 
1750 1770 1790 

GAACGTCAATCTTGACGCCGACGAGTGOAAAAAAGTGCTGCTGCCTTGGACGGTTCGGGG 
WVNLOADEWKKVLLPWTVRG 
1810 1830 1050 

TCCCGATAATGACAA7AAATTTAAATCAATTAACCAAAAACCAGAAAAA7ACAGCCAAAG 
PDNDMKrKS rNQKPEKYSQR 
1070 18»0 1910 

A7ACCGCA7CCGCGACAACAACGGCAA7CGCGA777GGGCGACAXCG7CAACAGCCCGAT 
VRIRDNMGNRDIiODIVMSPI 
1930 1950 1970 

7G7C0CG0XCGGCGGG7A777GGCAACCGCC0CGAACaACGGQA7Ga7QCATATC7rCAA 
VAVGGYLA7AANDGMVH 1 ITK 
1990 2010 2030 

AAAAAACOGCaaCAQ70A70AACGCAGC7ACAA7C7GAAGC7CAGC7ACAXCCCCGGCAC 
KMGGSDBRSYNIiKIiSyiPGT 
2050 2070 2090 

aA7GCCGCGCAAGGATA77CAAAGCCAAGAA7CCACCCT7GCCAAAGAGCTGC:GCGCC7T 
MPRKD IQSQESTliAKELRAr 
2110 2130 2190 

X0CCGAAAAAGGC7A7C7GGGCGACCGC7ACGGCG7GGACGGCGGCT77aXC77GCGCCA 
AEKGYVGDRYGVDGGrVLRQ 
2170 2190 2210 

AG7CGAAC7GAGCGGQCAAAAACACG7G7TTA7G77CGGCGCQA7GGG7777GGCGGCAG 
VELSGQKHVFMFGAMGFGGR 
2230 2250 2270 

GGGCGCG7ATGCCTTGGA777AAGCAAAA7CAACGGAAA7XA7CCGGCCGCCGCCCCCC7 
Q A L D h S K l NG N YP AAA P L 

2290 2310 2330 

GT77GATGTCAAAGA7GGCGA7AA7AACGGCAAAAA7CGCG7GAAAG7GGAAT7AGGCTA 
FOVKDGDN N GKNRVKVKX^Gr 
2350 2370 2390 

CACCGTCGCTACGCCGCAAATCGGCAAAATCCGCAACGGCAAA7ACGCCGCCTTCCTCGC 
7VG7PQIGKlRNGKYAArLA 
2410 2430 2450 

C7CCGG7XA7GCCGC7AAAAAAA77GACGAC7CAACAAA7AAAACCGCGC7G7A7G7A7A 
S0 YAAKKIDDSTNK7ALyVY 
2470 2490 2510 

7SAT77GAAAGACACC77AGG7ACGCCGA77GCAAAAA7CGAAG7GAAGGACGGCAAAGG 
DLKDTIiG7P lAKIEVKDGKG 
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f5 

2530 2550 2570 

C03aCTTTCGTCCCCCACGCTGGTGSATAAAGATTTG3ACGGCACOOTCOATATCGCCTA 

GLS 3 LVDKD I^PGTVD l Ar 

2590 261Q 2630 

TGCCaGCGACCGOaOCGGCAATATOTACCQCTTTGATTTGASCAATTCCGATTCTAOTAA 
AGDRGGNMVKrDLSNSDfiSK 

2650 2670 26d0 

ATOOTCTGCAAAGGTTATTTTCGAAGGCCACAAGCCGATTACCTCCGCOCCCGCCGTTTC 

WSAKVirEGDKP ITSAPAVS 
2710 2730 2750 

CCOACTQQCXGACAAACGCGTCGTCATCTTCGGTACGGGCAGCGATTTGACCQAAGATQA 
RLADKRVVJFGTGSDLTEDD 

2770 2790 2810 

TGTACTGAATACGGGCGAACAAXATATTTACGGa'ATCTTTGACGACGATAAGGGGACGGT 
VLNTGtOVlYGXrPOOKQTV 
2830 2830 2870 

TAAGQTAACQGTACAAAACGGCACGGCAGGCGGGCTGCTCGAGCAACACC7TACTCAGGA 
KVTVONOTAOGULBQHLTQC 

2890 2910 2930 

AAATAAAACATTATTCCTGAACAAGAGATCCGACGGTTCGGGCAGCAAGGGCTGGGCQGT 
NKTLF LNKRSDGSOSKGWAV 

2930 2970 2990 

GAAATTQACGGAAGGAGAACGCGTTACCGTCAAACCGACCGTGGTATTGCGTACCGCCTT 
KLREGCRVTVKPTVVIiRTAF 
3010 3030 3050 

CaTAACCATCCCCAAATATAACGACGGCGGCTGCGQCGCGGAAACCCCCATTTTGGGCAT 
VTlRKVNDGGCGAETAItiGI 

3070 3090 3110 

CAATACCGCCGACGGCGGCGCATTGACTCCGAGAAaCGCGCGCCCGATTGTGCCGGATCA 
NTADGGAL TPRSARP IVPDK 
3130 3150 3170 

CAAWCSGTTGCeCAATATTCCGGCCATAAGACAACCTCCAAAGQCAAATCCATCCCTAT 
WSVAQYSGHKTTSKGKSIPI 
3190 3210 3230 

AGOTTGTATOGACAAAGACGGTAAAACCGTCTGCCCGAACGGATAirGTtrTACGACAAGCC 
GCMDKDGKTVCPNGYVYDKP 
3250 32-70 3290 

GGTTAATGTOCGTTATCrOGATGAAACGGAAACAGACGGATTTTCAACGACGGCGGACGG 
VNVRVLDCTBTDGrSTTADG 
3310 3330 3350 

CGATOCGGOCOGCAGCGGTATAQACCCCGCCGGCAGGCGTCCCGGCAAAAACAACCGCTG 
DAGOSQIDPAGRRPGKNNRC 
3370 3390 3410 

CTTCTCCAAAAAAGGGGTGCGCACCCTGCTGATGAACGATTTGQACAQCTTGGATATTAC 
r3KKGVRTIiI,MNDt*DSt*DlT 
3430 3450 3470 

CGGCCCOATGTGCGOTATCAAACGCTTAAGCTGGCGCGAAGTCTTCTTCTGACCGGCCTG 
GPMCGIKRLSWRCVrr * 
3490 3510 3S30 

CGCGGCCGGTTTTTCCGCAAATGCCGTCCGAAAGGCCTTCQGACGGCATTTTTrTGCGTT 
3550 

TTTCGGGAGGGGGGCGGCAAA7GAAACG 
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10 30 50 

GA7CCGCCCGG7GCTTGG<jOCCCT7AQQGAACCGTTCCCTTTQAOCCCKiOaCGGGGCiUlC 

70 90 110 

GCGTACCGGTTTTTGTTAATCCGCTATAAAAGQCGG0CTATAGGGTJU5GCTTCATCCTG 

130 ISO 170 

CCAAXCTCACTGAATCC5TCAATTTCCGCAATTCAATTAAATACCGTCAAACCGATGCCG 

190 210 230 

' TCATTCCGCQCAGGCGGGAATCCGGACCGGTCGG0CATC7GCGGCGGT77GC7AAAAAAC 

250 270 290 

GCTTTACCGTGATAAGTGCGCAAAGTTAAAATGGGGAGGTAAGCTTTTCAATCAGCAATC 

310 330 350 

CaQCGGGCGCGGAATCGGGCQGTTTACCGAACCCCGGCGTTCGCGGCGCCCaTCCCGCGA 

370 390 410 

AGGCAAACTTAAGGAATAAAATATGAATAAAACTTTGAAACGGCAGGTTTTCCGCCATAC 
fcaaui 1 MHKTLKRQVFRHT 

430 450 470 

CGCGCTTTATGCCOCCATCTTGATGTTTT CC CATACCGOCGGGGGGGGGGGGCGATGGCG 
ALYAAtliHFSHTaGGGGRWR 

titmam 2 M A 

490 310 530 

CAAACCCATCAATACGCTATTATCATGAACGAGCGAAACCAGCCCGAGGTAAAGCAGAAT 

KPINTLL6* 
Q THQYAIIMNBRNQPEVKQN 

SSO 570 590 

GTGCCATCTTCAATAAAGGACAAAGACACGAGGCGCGAATATACTTATTATACGCACAGA 
VPSS IKDKDRRREYTYVTHR 

610 630 650 

ACAGGAOSAGGCTCTGTCTCATTCAACAATAACGATACCCTTGTTTCCCAACAAAGCGGT 
TGGOSVSFNNMDTLVSOQfiG 

670 €90 710 

ACTGCCGTTTTTGGCACAGCCACCTACCTGCCGCCCTACGGCAAGGTTTCCGGTTTTGAT 
TAVrGTATYLPPYGKVSGKO 

730 750 770 

GCCOTCGCTCTGAAAGAGCGCAACAATGCCGTTGATTGGATTCGTACCACCCaCATCGCG 
AVALKERNNAVDWIRTTRIA 

790 010 830 

CTOGC AGGCX ACTCCTACATCGACGTCATATGC AGAAGCTACACAGGCTOTCCCAAACTT 
LAGVSYIDVICRSYTGCPKL 

OSO 670 890 

GTCTATAAAACCCGATTTACCTTCGGTCAACAAGGGTTGAAAAGAAAGGCAGGCAGCAAG 
VYKTRrTFGQQGLKRKAGSK 

910 930 950 

CTGGAXATATACGAAGACAAAAGCCGCGAAAATTCGCCCATTTACAAATTGTCGGATTAT 
LDIYEDKSRCNSPIYKLSDY 

970 990 1010 

CCTTGGTTGGGCGTATCTTTCAATTTGGGCAGCGAGAATACCGTCCAAAATAGCAAATTA 
PWLOVSrMLGBBNTVQNSKL 

3 030 1050 1070 

TTCAACAAATTGATATCTTCTTTTAOAGAAGGCAATAATAATCAAACCATCGTCTCTACG 
ri^KIilSSTREGNNNQTIVST 

1090 xiio 1130 

ACAGAAGOCAACCCTATTTCCCTTGGCQACCGGCAQCGCGAACATACCGCCGTQGCCTAT 
TBGNP l S LGDRQRKHTAVA Y 

1150 1170 . 1190 

TATCTGAACOCCAAACTGCACCTCCTCGACAAAAAAQaGATTGAAGATATCGCCCAAGGC 
VLNAKLKliLDKKGICD X AQG 

2-210 3 230 1250 

AAAATAQXGGAXXXGGGXATCTTGAAACCGCACGXCGAGACGACA GACQAAGCTTGCTA 
K IVDLQ I LKPHVETXGRS L li 

1270 1290 1310 

GAXTTTXGQGCTAGGXGGGACATTAAAGATACCGGGCAGATTCCGGTCAAGCTCGGCCTG 
DrWARWDlKDXGQlPVKX.GL 
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1330 1350 1370 

CCGCAAGTCAAAGCAGGCCGCTGCACCAACAAACCGAACCCCAATAATAATACCAAAGCC 

PQVKAQRCTNKPNPNNNTKA 

1390 1410 1430 

CCTTCGCCGGCACTGACCGCCCCCGCGCTGTGGTTCGGACCCGGGCAAGATGGTAAGGCG 
PSPALTAPALWFG PGQDGKA 

1450 1470 
aAGATOTATTCCGCTTCGGTTTCCACCTACCCCGACAGTTCGAGCAGCCGCATCTTCCTC 

EMVSA3VST^PD33SSRIFI- 

1510 1530 1S50 

CAAGAGCTGAAAACTCAAACCGAACCCGGCAAACCCGGCCGCTATTCCCTCAAATCTTTG 
QELKTQTEPGKPGRYSLKSL 

1570 1590 1610 

AArrOATGaTGAGATTAXAAGTCGACAGCCQAOTTTCAACGGGCGGCAAACAATCATCCGA 

NDGEIKSRQPSFNGRQTI IR 

1630 1650 1670 

TTGOATOACOaCaTACATTTGATCAAACTGAAXGGAAGCAAGGATOAQaTCGCCGCTTTT 
LDDGVHiilKIiNGSKDEVAAF 

1690 1710 1730 

QTCAATTTAAATGGAAACAACACCGGCAAAAACCACACXTTCGGCATTGTTAAGGAAGCG 

VNLNGNNTGKNDTFGIVKCA 

1750 1770 1790 

AACGXCAATCTTGACGCCGACGAGTGGAAAAAAGXGCTGCTGCCTTGGACGGTTCGGGGT 
NVNLDADEWKKVIiliPWTVRG 

1810 1830 1850 

CCCGATAAXGACAATAAATTTAAATCAATTAACCAAAAACCAGAAAAATACAGCCAAAGA 
PDNDNKrKSINQKPEKYSQR 

1870 1890 1910 

TACCGCATCCGCGACAACAACGGCAATCGCGAT'rrGGGCGACATCGTCAACAGCCCGATT 
YRlRDKNGNKDluGDlVNSP I 

1930 1950 1970 

GrCGCGGTCOGCaOOTATTTGGCAACCGCCGCGAACaACGGGATGGXGCATATCTTCAAA 

VAVOOVLAXAANDGMVH IFK 

1990 2010 2030 

AAAAACGGCGGCAGXGATGAACGCAGCXACAAXCXGAAGCTCAGCXACAXCCCCGGCACG 
KNGOS DE RSXNLKLSriPGT 

20r>0 2070 2090 

ATOCCGCGCAAGGAXAXXCAAAGCCAAGAAXCCACCCXXGCCAAAaAGCXGCGCGCCTTT 
MPRKDIQSQESTI-AKEIiRAF 

2110 2130 2190 

GCCGAAAAAOOCXAXaXGGQCOACCCCXACGGCaXGGACGGCGGCTXXGXCXXGCGCCAA 

AEKGYVGDRVGVDGGFVLRQ 

2370 2190 2210 

QXCGAACXOAOCGOGCAAAAACACGXGXXTAXGXXCGGCGCGAXGGGXXXXGGCGGCAGG 

VEL 8 GQ KHVFMFGAMG FGGR 

2230 2250 2270 

GGCGCGXAXGCCXXGGAXXXAAGCAAAAXCAACGGAAATTATCCGGCCGCCGCCCCCCTG 
GAYALDliSKINGNYPAAAPb 

2290 2310 2330 

TTTGAXGXCAAAGAXGGCGATAATAACGGCAAAAAXCGCGXGAAAGXGGAAXXAGGCTAC 

FDVKDCDNNQKNRV .KVEL.GY 

2350 2370 2390 

ACCGTCGGXACGCCGCAAAXCGGCAAAAXCCGCAACGGCAAAXACGCCGCCXXCCTCGCC 
TVGTPQIGKIRNGKYAAFLA 

2410 2430 2450 

TCCGGTTAXGCGGCTAAAAAAATTGACGACXCAACAAAXAAAACCGCGCXGXATGTATAT 

5GYAAKKIDD6XNKTALYVY 

2470 2490 2510 

GAXTXGAAAaACACCTXAGGTACGCCGAXXGCAAAAATCGAAGTGAAGGACGGCAAAGGC 
DI^I^DTLQXPIAKIBVKDGKG 

2530 2550 2570 

GGGCTTTCGXCCCCCACGCXGGXGGAXAAAGAXXXGGACGGCACGGXCGAXAXCGCCTAT 
Gli8SPTLVDKPi:*DGXV0IAY 
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2590 2£I0 2G3 

GCCGGCQACCGGCaCOSCAATATQTACCQCTTTOATTTGAGCAATTCCGATTCrACSTAAA 
AGDRGGNMVRFD LSNaDSSK 

2650 2670 2690 

TGGTCTGCAAAGGTTATTTTCGAAaaCQACAAGCCGATTACCTCCGCOCCCGCCGTTTCC 
WSAKVIFEGDKP XT3APAVS 

2710 2730 2750 

CGACTGOCAOACAAACOCaTCGTCXtrCTTCGGTACaGQCAGCGATTTGACCGAAGATGAT 
RLADKRVVIFGTaSDIiTEDD 

2770 2790 2810 

GTACTGAATACGGGCGAACAATATATTTACGGTATCTTTGACaACGATAAGGGGACGGTT 
VliNTOBQV lYOIFDDDKGTV 

2830 28S0 2670 

AAGGTAACOOTACAAAACGGCACGGCAQGCGGaCTGCTCGAGCAACACCTTACTCAGGAA 
KVTVQNGTAQOtiliEQHLTQK 

2890 2810 2830 

AATAAAACATTATTCCTGW^CAAGAGATCCGACGQTTCGGGCAGCAAQGGCTGGGCGGTG 
NKTLri^NKRSDGSOS KGWAV 

28S0 2870 2880 

AAATTGAGGGAAGGAGAACGCGTTACCGTCAAACCGACCGTGGXATTaCQTACCGCCTTC 
KliRBGERVTVKPTVVIiRTAt' 

3010 3030 3050 

QTAACCATCCGCAAATATAACGACGGCGGCTGCGGCGCCMAAACCGCCATTTTGGGCATC 
VTIRKY NPG QCGAETAILGl 

3070 3080 3110 

AATACcaccaAr.ftrtr.finr.rtnATTRAnTr.rx;AriAA(v^rw^GcrK;f:cGATTGTa 

NTAOGQXLTPRSARP IVPDH 

3130 3150 3170 

AATTCGGTTGCGCAATATTCCGCCCATAAGACAACCTCCAAAGGCAAATCCATCCCTATA 

M3VAQYSGHKTT3 KGKSI'P I 

3180 3210 3230 

G0TTGTATQGACAAAGACGG7AAAACCG7CTGCCCGAACGGA'rA*rG7'XTACGACAAGCCG 
GCMDKDGKTVCPNG7 VYPKP 

3250 3270 3280 

G7TAATGTGCG7TArrCTGGA7GAAACGGAAACAGACGGA'r'rTTCAAC;C2ACGGCGGACGGC 
VNVRYLDETSTDGrSTTADG 

3310 3330 3350 

GATGCGGGCGGCAGCGGTATAGACCCCGCCGGCAGGCGTCCCQGCAAAAACAACCGC7GC 
DAOOSOIDPAGR RPGKNN RC 

3370 3390 3410 

TTCTCCAAAAAAGGGGTGCGCACCCTGCTGATGAACGATTTCGACAGCTTGGATATTACC 
rSKKGVRTIiLMNDLDSIiDiT 

3430 3450 3470 

GOCCCGATOTOCOOTATCAAACGCTTAAGCTGGCGCGAAGTCTTCTTCTGACCGGCCTGC 
OPMCGlKRIiSWREVrr * 

3480 3510 3530 

GCOGCCGGTTTTTCCGCAAATGCCGTCCGAAAGGCCTTCGQACGGCATTTTTTTGCGTTT 

3550 

TTCGGGAGGGGGGCGGCAAATGAAACG 
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1 CCOCTGTATGTGTATGATTTGaAAAACACCAGTGCTAGTCTGATTAAAAAAATCGWlGCA 
CCCGGCGGCAAAGGCGGGCTTTCGTCCCCCACGCTGGTGGATAAAGATTTGGACGGCACG 
GTCGATATCGCCTATGCCGGCGACCGGGGCGGCAATATGTACCGCTTTGATTTGAGCAAT 
TCCGATTCTAGTAAATGGTCTGCAAAQGTTATTTTCGAAGGCGACAAGCCQATTACCTCC 
GCGCCCGCCGTTTCCCGACTGOCAGACAAACGCGTGGTTATCTTCGGCACGGGCAGCGAT 
TTGAGTGAACAGGATGTACTGGATACGGACAAACAATATATTTACGQTATCTTTGACGAC 
GATAAGTCGACGGTTAATGTAAAGGTAACAAACGGCACQGGAGGCGGGCTGCTCGAGCAA 
GTOCTTAAAGAGGAAAGTAAAACCTTATTCCTGAGCAATAATAAGGCATCCGGCGGATCG 
GCCGATAAAGGGTGGGTAGTGAAATTGAGGGAAGGAGAACCCGTTACCGTCAAACCGACC 
GTGGTATTGCGTACCGCCTTTGTCACCATCCGCAAATATACGGATACGGACAAATGTGGC 
OCOCAAACCGCCATTTTGGGCATCAATACCGCCGACQGCGGCGCATTGACTCCGAGAAGC 
GCOCGCCCGATTGTGCCGGATCACAATTCGGTTQCGCAATATTCCGGCCATCAGAAAATG 
AACGGCAAGXCCA7CCCGG 73 d 
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PLVVYP LENTSGSIilKKIEA 
1 CCGCTGTATGTGTATGATTTGGAAAACACCAQTGG'rAGTCTGATTAAAAAAATCGAAGCA 

• t » • • • 
PGOKGGLSSPTIrfVDKD LDGT 
CCCGGCOGCAAAGGCQGGCTTTCGTCCCCCACGCTGGTOOATAAAGATTTQGACGGCACG 

• « • ft * . 
VDIAYAGDRGGNMYRFOLSN 
QTCGATATCGCCTATGCCGGCGACCQGGGCGGCAATATGTACCGCTTTGATTTGAGCAAT 

• • • ♦ • • 
SDSSKWSAKVirEGDKPITS 
TCCGATTCTAGTAAATGGTCTOCAAAGGTTATTTTCCAAGGCGACAAGCCGATTACCTCC 

• • t • • • 
APAVSRLADKRVVirQTGSD 
GCaCCCacCGTTTCCCGACTGGCAGACAAACGCaTGGTTATCTTCGGCACGGGCAGCGAT 

• • • t i • 

LfiEQDVLDTDKQYiyaiFDD 
TTGAGTGAACAGGATGTACTGGATACGGACAAACAATA'rA'XTTACGGTATCTTTGACGAC 

• • • • • • 

DKSTVNVKVTMOTOGGLtiEQ 
GATAAGTCGACGGTTAATGTAAAGGTAACAAACGGCACGGGAGGCGGGCTGCTCGAGCAA 

• • • • * • 
VLKCSSKTLT LSNNKASGGS 
QTOCTTAAAGAGGAAAGTAAAACCTTATTCCTGAGCAATAATAAGGCATCCGGCGGATCG 

• • • • • • 
ADKGWVVKIiREGERVTVKPT 
GCCGATAAAGGGTQGGTAGTGAAATTGAGGGAAGGAGAACGCGTTACCGTCAAACCGACC 

• • • , ' • • • 
VVLRTArVTlRKrXDTDKCG 
GTOOTATTOCGTACCGCCTTTGTCACCATCCOCAAATATACGGATACGGACAAATGTGac 

• • • • • • 
AQTAlLGlNTADGGAIiTP RS 
GCGCAAACCGCCATTTTGGGCATCAATACCGCCGACGGCGGCGCATTGACTCCGAGAAGC 

• * • • • 
ARP IVPDHNSVAQYSOHQKM 
GCGCGCCCGATTGTGCCGGATCACAATTCGGTTGCGCAATATTCCGGCCATCAGAAAATG, 

M G K S I P 
AACGGCAAGTCCATCCCGG 739 
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PLYVVDLENTSGSLIKKIEA 
1 CC3CX3TATaT0TATGATTTGGiUUUW:w:CAQT0GTAGTCTGATTAAAAAAATCGAAGCA 

I I I t I I M I I H M H M I n 111(1 (Ml ) (Hi ( t i I I M M 1 ( 
2446 GCGCTaTATGTATATGATTTGAAAGACACCTTAGGTACaCCGATTQCAAAAAXCGAAGXG 
ALYVAOIi2J*nTL<3X2.IKKIEA 

PGGKGGLSSPTLVDKDLDGT 
CCCOOCGOCAAAGGCGGGCTTTCGTCCCCCACGCTCGTGGATAAAGATTTGGACGGCACG 

1 I I I 1 I I f J ) I I 1 I I t I I I I t I I I I II I M I ) It I f 1 M I I I I II I II i H 1 I M I 
AAGGACGGCAAAGQCGGGCTTTCGTCCCCCACGCTGGTGGATAAAGATTTGGACGGCACG 
iiUGKGGLSSPTLVDKDI*DGT 

• • • ■ • • 

VD lAYAGDRGGNMYKFDIiSN 
GTCGATATCGCCTATGCCGCCGACCGGGGCGGCAATATGTACCGCXTTGAXTTGAGCAAT 

I I I It I I I I I II I I M I I I 1 I I It I t I M I I I I II I t I I M t I I I I 1 1 I I I I I I I II I I I 
OXCOAXATCGCCXATGCCGGCGACCGGGGCGGCAATAXGXACCGCXXXGAXXXQAGCAAX 

VD ZAYAGDRGGNMYRrO L3N 



SDSSKWSAKVirCGDKP IXS 
XCCGAXXCTAGXAAAXGGXCXGCAAAGGXXAXXXXCGAAGGCGACAAGCCGAXTACCXCC 

11 I I I I II I I I II 1 I I I I I I t I I I I II M I t! f 1 I I M I I II I II M ) 1 I t 1 n I I M 1 I 

XCCOATXCXAOXAAAXGOXCXGCAAAGGXTAXTTTCGAAGGCGACAAGCCGAXXACCXCC 
SDSSKWSAKVIFEGDKPIXS 



APAV3RXiADKRVVIFGXG$D 
GCGCCCGCCGXXXCCCGACXGGCAGACAAACGCGXGGXXAXCXXCGGCACGGGCAGCGAX 

M t t I M I It 1 I II i t t M t i M I I t I 1 I I t I t M II I 1 I t t I I I I I I I t t 1 t t M I 
GCGCCCGCCGXXXCCCGACXGGCAGACAAACGCGXCGXCAXCXXCGGXACGGGCAGCGAX 
APAVSRXiADKRVVirOXGSD 

• • • • p # 

LSEQDVLDXDKQYIYGIFDD 
TXGAGTGAACAGGAXCXACXGCAXACGGACAAACAAXAXAXXXACGGXAXCXTTGACGAC 

lilt Ml ! MIIINII llllll 1 IMlMllttllllllltnillltllll 
XXOACCGAAGAXGAXGTACXCAAXACGGGCGAACAAXATAXXTACOOXAXCXXXGACGAC: 
LXSO.DVL23TfifiQ^ IXGXrDO 
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DKSXVNVKVTNGTQOGLI-EQ 
GAXAAGXCGACCGXXAAXGXAAAGGXAACAAACGGCACGGGAGGCGGGCXGCXCGAGCAA 
IMIli tlllllMI MM MM IMMMMM IMMMIMIIIMMM 

GAXAAGGGGACGGTTAAOGXAACOOXACAAAACGGCACGGCAGGCGGGCXGCXCGAGCAA 
DKfiXV2SVXVfiNGTAG0I*LBQ 
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GXOCTXAAAGAGGAAAGXAAAACCXTAXTCCXGAQCAAXAAXAAGGCATCCGGCGGATCG 
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CACCTX ACXCAGGAAAAXAAAACAXTATXCCXC AACAAGAGAXCCGACGGXXCG 

HliXiiEIlKXLrL NKasnCS 

ADKGWVV KLREGCRVXVKPX 
GCXGAXAAAGGG-GGGXAGTCAAAXXGAGGGAAGGAGAACQCCXXACCaTCAAACCGACC 
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GGCAGCAAGGGCXGGGCGGTGAAAXTGAGGQAAGGAGAACGCGXXACCGXCAAACCGACC 
GS.KGWAVKLRBGCRVTVKPT 
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VVLRTAFVTIRKVTDTOKCG 
GTOGTATTGCGTACCGCCTTtTGTCJlCCATCCGCAAATATACGGATACGGACAAATGTGGC 

I I 11 i I I 1 It M I M 1 I 1 i I II ilMMUMlinn It t I M IN 
aTGGTATXGCGTACCGCCTTCGTAACCATCCGCAAATATAACGA. . • CGGCGGCTGCGGC 
VVLATAr-VTIRKrnD QQCG 

• • ■ * 

AQTAILGINTADGGALTPRS 
GCOCAAACCGCCATTTTGGGCAXCAATACCGCCGACGGCGGCGCATTGACTCCGAGAAGC 
Ml lltlllllillttlllllllllltllllltlllllllllllltMlliniMllt 
OCOGAAACCGCCATTTTGCGCATCAATACCGCCGACOGCGGCGCATTGACTCCGAGAAGC 
A£TAILGINTADQ0ALTPAS 
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ARPIV?DHNSVAQrSGHQKM 
GC8CGCCCGAXTGTGCCGGATCACAATTCGGTTCCGCAATATTCCGGCCATCAGAAAA. 

I I I II I I I I t 11 M 1 It I M II tl I I M I I I I I II II I M M I I M I I I I I Ml M 
GCGCGCCCGATTGTGCCGGATCACAATTCGGTTGCGCAATAXTCCGGCCATAAGACAAC 

ARPIVPDHNSVAQX3GH1LXX 
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CTCCAAAGGCAAAXCCAXCCCAX 3X76 
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